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PREFACE 


It is by no means easy for the applied mathematician to decide 
how^ much importance he should attach to the more abstract and 
aesthetic side of his work and how much to the detailed applica- 
tions to physics, astronomy, engineering or the design of instru- 
ments. (Jreat mathematical ideas do not blossom in workshops, 
as a rule, but on tlie other Jiand the theorist should not divorce 
hiinself from a liealthy and intimate connection with practical 
questions. 

Sir William Rowan Hamilton (1805-1865) created a method in 
Geometrical Optics, which, after lying long in disuse, is at last 
finding its proper place in the science. To all appearances, 
Hamilton attached little importance to the practical applications 
of his method, and it was only with the publication of his Mathe- 
matical Papers, Vol. i (Cambridge, 1931), that it was possible to 
form a more (torreci and balanced judgment of Hamilton as an 
applied mathematician. Great indeed was the labour whic^h he 
employed with a view to applying his method to the design of 
o})tical instruments, but lor him the abstract and aesthetic side 
of his work was of so much greater public importance than its 
practical use that the details of application remained unpublished 
till long after his death and long after other workers had dis~ 
covered equivalent processes. 

Since it was left largely to those primarily interested in optical 
design to develop the subject of Geometrical Optics, it is only 
natural that the student of the subject soon finds himself im- 
mersed in details which tend to cloud his understanding of the 
underlying general principles. Now, just as it is widely recognized 
that in the teaching of Mechanics a middle course must be steered 
between a completely abstract presentation and a technical 
approach, so it seems to me that the student of Geometrical 
Optics is most likely to understand the principles of Hamilton’s 
method if he does not think too much at first of technical applica- 
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tions. But, at the same time, he should not be kept entirely 
remote from them. 

Since editing, in collaboration with Professor A. W. Conway, 
F.R.S., Hamilton’s papers on Geometrical Optics, I have had the 
opportunity of lecturing on the subject to graduate students and 
undergraduates in the University of Toronto. This book repre- 
sents a course of twenty -five lectures to the latter. Although the 
reader may fail to find in it some things which he would naturally 
expect in a book on Geometrical Optics, no apology is offered on 
that account. If Hamilton’s method is iinderstood, the book 
serves its purjjose. For that reason it is not jiecessary to defend 
the application of the method to problems which would admit 
shorter special solutions. 

Hamilton was a master of mathematical notation, and he 
might in this respect be profitably studied by some modern writers 
in our subject. I have employed his notation in the main , changing 
the signs of the W and T functions to make their physical inter- 
pretation more obvious, and making some changes in nomen- 
clature. It does not seem necessary or desirable to use the word 
“eikonal”, which Bruns invented in 1895 in ignorance of 
Hamilton’s work. Since one letter is just about as good as 
another, would it not be a harmless compliment to the genius 
of Hamilton for writers on Geometrical Optics to employ 
for the various characteristic functions the letters which he 
employed? 

Although Hamilton himself started by considering the simpler 
case of isotropic media, it was not long before he saw that his 
method was also applicable to anisotropic media, and when he 
came to give his theory final form in his Third Supplement, he 
did so in all generality. This has done much to discourage those 
interested in the more practical asjjects of his method, because in 
order to apply it they have been compelled to think in terms of 
(to them) unnecessary generality. To avoid a repetition of this 
error of policy, the theory of anisotropic media has been entii-ely 
omitted from this book. To compensate for this omission and for 
the fact that, although an attempt has been made to amplify 
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Hamilton’s work in the directions since found of most interest, 
these amplifications liave not been sufficient to create an adequate 
text-book, a brief bibliography is given below. In some of these 
works Hamilton’s characteristic functions are referred to as 
Bruns’ eikonals, but there is no significant difference. 

I have to thank three of my students, Messrs H. R. Roberts, 
P. R. Wallace and A. White, for assistance in the preparation of 
the manuscript, and my colleagues, Professor A. P. Stevenson 
and Dr B. A. Griffith, for reading the proofs and making valuable 
suggestions. It is also a pleasure to pay tribute to the skill and 
accuracy of the Cambridge University Press. 

J.L. S. 

Toronto 
October 1937 
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CHAPTER I 


THE PRINCIPLES OP GEOMETRICAL 
OPTICS FOR ORDINARY MEDIA 

1. The nature of geometrical optics. 

A ‘‘perfect” scientific theory may be described as one which 
proceeds logically from a few simple hypotheses to conclusions 
which are in complete agreement with observation, to within the 
limits of accuracy of observation. Rut the theory is useful” only 
in so far as it is possible to obtain conclusions from the hypotheses. 
As accuracy of observation increases, a theory ceases to be 
“perfect”: modifications are introduced, making the theory 
more complicated and less “useful”. Since we do not willingly 
surrender the wealth of approximate results furnished by the 
earher form of the theory, we find ourselves in the unsatisfactory 
position of using one theory for one problem and another for 
another, although the two problems really belong to the same 
part of science. To rescue ourselves from intellectual confusion, 
we may admit theories called “ideal”, in the sense that they deal 
with an ideal universe, resembling the actual universe to a fair 
degree of accuracy and usually corresponding to a limiting case 
of physical reality. 

A critical examination of the history of mathematical physics 
shows that in truth man has always created “ideal” theories. 
Nature is much too complicated to be considered otherwise than 
in a simplified or idealized form, and it is inevitable that this 
idealization should lead to discrepancies between theoretical 
prediction and observation. As examples we may mention the 
mechanical theories of rigid bodies and perfect fluids; neither 
rigid bodies nor perfect fluids exist in nature. Or we may think 
of the Newtonian theory of gravitation, long regarded as “per- 
fect”, but now “ideal”, physically replaced by the “perfect” 
(but not so “useful”) general theory of relativity. 

Geometrical optics is an ideal theory and a useful one. The 
discovery that the propagation of light is an electromagnetic 
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phenomenon made the subject of optics coextensive Avith electro- 
magnetism. We may, however, study certain parts of the subject 
of optics without reference to electromagnetism, always under- 
standing that there is a limit to the physical accuracy of the 
results so obtained. It is customary to use the name “physical 
optics” for the more complex and physically accurate theory, 
and “geometrical optics” for the simpler ideal theory with 
which we shall be concerned. It is possible to justify geometrical 
optics as a limiting case of physical optics, the wave-length of the 
light in question tending to zero; f but we shall be content with 
the development of geometrical optics on the basis of its own 
hypotheses, just as it is customary to develop the dynamics of 
rigid bodies as a separate theory, and not as a limiting case of the 
dynamics of elastic bodies whose elastic moduli tend to infinity. 

2. Fermat’s principle: laws of reflection and refraction. 

We consider the propagation of light through transparent 
media. We shall understand by an ordinary medium one which is 
homogenexma (the same at all points) and isotropic (the same for 
all directions) with respect to the propagation of light, deferring 
to Chapter v the discussion of media which are heterogeneous 
(like the atmosphere); anisotropic (crystalhne) media will not be 
discussed. 

Although the wave-length or frequency or colour of the light 
does not enter explicitly into the theory of geometrical optics, 
we admit that light is of various sorts. We shall consider separ- 
ately the propagation of lights of different colours, so that at any 
one time we shall be dealing with monochromatic light. 

In an ordinary medium light of a definite colour has a constant 
velocity of propagation v, different for different colours. In a 
vacuum, a particular case of an ordinary medium, the velocity of 
propagation is the same for all colours : it will be denoted by c. The 
index of refraction or refractive index of a medium is defined to be 

(2-1) p = cjv, 

so that p = 1 for a vacuum. For all media p 1. For most 
t M. Born, Optik (Berlin, 1933), p. 45. 
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practical purposes air may be treated as a vacuum, because its 
index of refraction differs very little from unity: for air, the 
index (for the sodium D-line) is 1*0003. 

If C is a curve passing through transparent media, joining 
points A' and A, the time which light would take to travel along 
G with velocity v would be 


(2*2) 


JA-V cJa' 


where ds is an element of the curve. Since v is constant in each 
of the media (supposed ordinary), this may also be written 


(2*3) 


< = 2 = 


1 ^ 
^ A' 


where there is one term of the summation for each medium 
traversed, s being the length of 0 contained in it. We define the 
optical length of G to be 

(2-4) [CJ = ct — j /ids = '^/is. 

We shall use square brackets to indicate optical lengths. 

We sliall now state the basic hypothesis of geometrical optics: 

Fermat's principle: When light travels from A' to A, it travels 
along a path or ray for which the time taken {or equivalently the 
optical length) has a stationary value with respect to infinitesimal 
variations of the path. 

Usually the time will be a maximum or minimum. 

Let us consider a single medium. Let A' and A be two points 
in it. Since the straight line joining A' and A has the shortest 
length of all possible curves joining these points, the optical 
length of this straight line (which only differs from the geometrical 
length by the constant factor /i) has a stationary value. Thus, 
in a single ordinary medium light travels in straight lines. 

It would however be wrong to suppose that light can travel 
from A' to A only along the straight line A' A. It may pass from 
A' to the boundary of the medium and thence be reflected back 
to A. We shall now deduce the law of reflection from Fermat’s 
principle. 
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Let light travel from to a surface 8 (Fig. 1) which bounds 
the medium in question, and hence back to ^4. 8 functions as a 
mirror. Let B be any point on 8, so that A' BA is in general an 
unnatural path (not a ray); its optical length is 
(2-5) [A' BA] = /ip' + /ip, 

where p' = A'B, p = BA. Let us take any rectangular axes of 
coordinates, and let the coordinates of the points be as follows: 
(2-6) A' -.x^y^z', A:x,y,z, B:x",y'',z''. 



Fig. 1 

Then 

(2-7) p'^ = Z{x'-x"f, p^ = i:{x-x"f, 

the 27 meaning a sum obtained by changing x->y->z. Giving to 
B an arbitrary infinitesimal displacement 8x", Sy", 8z", we have 
(2-8) p'8p' =:-2:{x'-x")8x", p8p = -S(x-x")8x", 

or if a', P', y' are the direction cosines of A'B and a, P, y those 
of BA, so that 

(2-9) a'p' = x"-x', P'p' = y"-y', y'p' = z"-z', 
ap = x-x", pp^y-y", yp = z-z", 

we have 

(2-10) 8p' = i:a'8x", 8p = -l'a8x". 
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Thus for an arbitrary infinitesimal displacement of B on 8, we 
have for the variation of the optical length 

(2-11) 8[A'BA]= ii8p'+ti8p 

= pS{a' — a) 8x'' . 

In order that this may vanish, as demanded by Fermat’s prin- 
ciple, for all arbitrary infinitesimal displacements of B on 8, the 
vector whose components are 

(2-12) a'-a, P'-fi, y' -y 

must be parallel to the normal to 8 at B, or equivalently 

(2-13) = 

I m n 


where I, m, n are the direction cosines of the normal to 8 
at 7i; wo shall take the normal drawn into the mirror as shown 
in Fig. 1. 

The angle of incidence {i') is the angle between the incident ray 
produced and the normal to 8, and the angle of reflection (i) is 
the angle between the reflected ray reversed and the normal. If 
we mark O' on the incident ray produced, and C on the reflected 
ray, making 

^ BC' = BG=\, 


then the coordinates of O' relative to B are (a', P' , y') and those 
of 0 relative to B are {a,p,y). Thus the vector (2*12) is the dis- 
placement CC'\ this is parallel to the normal at B. Hence it 
follows immediately that the law of reflection may be stated as 
follows : 

(i) the incident ray, the reflected ray and the normal to the 
mirror at the point of reflection are coplanar; 

(ii) the angle of incidence is equal to the angle of reflection 
(*■' = »)• 

It is easily seen that the common value of the fractions in (2’ 13) 
is 2cosi. 

The analytic expression (2-13) for the law of reflection enables 
us to determine the reflected ray when the directions of the 
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incident ray and the normal to the mirror at the point of incidence 
are known, for we have in (2*13) and the identity 

(2-14) a.^ + ^ + y^=\, 

three equations for the three unknowns a, /?, y. Since a quadratic 
equation occurs, there will be two solutions: the extraneous 
solution (to be rejected) is 

a = a', = y = y'. 



When a ray passes from a medium M' of index /i' across a 
surface of separation 8 into a medium M of index /*, the ray in 
general undergoes an abrupt change of direction on crossing 8, 
this phenomenon being known as refraction. Let us now in- 
vestigate the law of refraction on the basis of Fermat’s principle. 

Let A' be a point in M', A a point in M and B any point on 8 
(Fig. 2). Adopting the same notation aa that xised above for the 
case of reflection, we have for the optical length of the unnatural 
path A' BA 

(2-15) [A’BA]= fi'p’+/ip, p' = A'B, p = BA. 
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Then giving to B an arbitrary displacement 8x'' , 8y", 8z'' on S, 
we have 

(2-16) 8 [A' BA] = it’8p'+fi8p 

— p' Saf 8x'' — iiSa8x" 

= S {p'a! — pa) 8x" , 

and, since this is to vanish for the natural ray by Fermat’s 
principle, we have (as the analogue to (2*13)) 

(2- 17) -1^ ^ ^ f^'Y -1^7 

I m n ' 

If we lay off BC = p' along the incident ray produced, and 
BC = p along the refracted ray, the coordinates of C" "relative 
to B are {p'a',p'^',p'y'), and those of C relative to B are 
(pa,p^,py). Hence the numerators in (2-17) are the components 
of the. displacement CC', which is therefore parallel to the 
normal to S at B. 

The angle of incidence {i') is the angle between the incident ray 
produced and the normal to 8 (drawn from M' into M), and the 
angle of refraction (i) is the angle between the refracted ray and 
the normal to 8. We may state the law of refraction as follows: 

(i) the incident ray, the refracted ray and the normal to the 
refracting surface at the point of incidence are coplanar; 

(ii) the angle of incidence and the angle of refraction are con- 
nected by the relation 

(2‘18) p' sini' = psini. 

This last relation follows at once by equating the lengths of the 
perpendiculars dropped from C' and C on the normal to 8 at B. 
The common value of the fractions in (2- 17) is 

p' cos i' — p cos i. 

As in the case of (2-13) for reflection, (2-17) (with (2-14)) give the 
direction of the refracted ray when the directions of the incident 
ray and the normal at the point of incidence are assigned. There 
is an extraneous solution arising from the quadratic equation 
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involved. To see what it is, we choose special axes, the z-axis 
being normal to S at B, so that 

(2'19) I = m = 0, n = 1. 

Then (2- 17) reduce to 

(2'20) /la = /*'«', /i/d = /i'/d'. 

These determine a, /?; y is given by 
(2’21) y = 

Obviously we must have 7 > 0 : the extraneous solution corre- 
sponds to the negative radical in {2-21), and hence the direction 
given by the extraneous solution is the geometrical reflection in 
the tangent plane to S nt B of the true refracted ray. 

Under certain circumstances it is impossible for a ray from M' 
to be refracted into M. Then reflection only, and not refraction, 
can take place: this phenomenon is known as total reflection. It 
occurs when i cannot be found to satisfy (2' 18), that is, when 

(2-22) '^sini'>l. 

Obviously total reflection can take place only if /i' >/i. 

3. Normal and skew congruences: theorem of Malus. 

A system of curves filling a portion of space, and such that in 
general a single curve passes through any assigned point, is called 
a congruence. For example, the normals to a surface form a con- 
gruence. If we denote by a, /y, y the direction cosines of the tangent 
to the curve of the congruence at a point x, y, z, the con- 
gruence maybe defined by expressing a, /?, y as functions of x, y, z, 

(3-1) a=f{x,y,z), = g{x,y,z), y = h{x,y,z). 

These three functions are not independent; they must satisfy 
the identity 

(3-2) f^ + g^->rh^ — a^-\-^-\-y^= 1. 

If the curves which form the congruence are straight lines, the 
congruence is said to be a rectilinear congruence. The congruences 
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with which we have to deal in the geometrical optics of homo- 
geneous media are rectilinear. 

If there exists a singly infinite family of surfaces cut ortho- 
gonally by the curves of a congruence, the congruence is said to 
be normal ', if such a family does not exist, the congruence is said 
to be skew. 

Suppose there is a normal congruence of curves, defined as in 
(3'1). Let the equations of the normal surfaces be expressed in 
the form 


(3-3) F{x,y,z) = con?,t. 

The direction cosines of the normal to the surface of this family 
at a point x, y, z have the ratios 

dF dF dF 

I’herefore ' 

dF dF dF 

( 3 . 4 ) ■ 

where is a factor of proportionality. Differentiating, wo have 
d d^F d^F d 


and therefore 


dy 0/? 
dy dz 


dO ,, dO , 


Similarly, we obtain 


' Jda dy\ i 

\dz~'dxr'^'\ 


dp da 
dx dy 


n 

■a~^ = 0 . 
dy 


Multiplying these equations in order by a, /?, y, adding, .and 
dividing by 6, we obtain 

This condition is necessarily satisfied if the congruence is normal. 
Moreover it is known from the theory of total differential 
equationsf that if (3'7) is satisfied, then functions 0 and F of 
t Cf. H. T. H. Piaggio, Differential Equations (London, 1933), p. 140. 
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X, y, z exist such that (3’4) are true. In other words, the equation 

oi.dx + fidy + ydz = 0 

is integrable. Consequently (3*7), if true, implies the existence 
of a family of surfaces (3-3) to which the curves of the congruence 
are normal. Therefore {Z-1) is a necessary and sufficient condition 
tJuit a congruence be normal. 

As an example, it is easily verified that the congruence defined 
a = xjr, ^ = yjr, y = zjr, {r'^ = x^ + y^-\-z^), 
is a normal congruence. On the other hand, it may be shown 
that the congruence defined by 

^ OL — yjr, p = —xjr, y = zjr 

is a skew congruence. 

If we are given a family of surfaces 

F(x,y,z) = const.. 


there exists a normal congruence of which these surfaces are the 
normal surfaces. The curves of this congruence are called the 
orthogonal trajectories of the family of surfaces. The congruence 


has the equations 






where 



We shall now show that the system of straight lines normal to 


any assigned surface is a normal con- 
gruence. Let Sq (Fig. 3) be the given 
surface and let S be the surface formed 
by cutting off the same length s from all 
the normals to (Sq. This construction 
puts the points of Sq and S into one- 
to-one correspondence. Let A, B he 
the points of 8 corresponding to Aq, 
Bq on Sq, respectively, the distance 



Af^B^ being infinitesimal. Join Aq to Kg- 3 


B. Since BBq is normal to Sq at Bq, the infinitesimal displace- 
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merit is perpendicular to Hence 

AqB = BqB = 5 = AqA, 

to the first order of infinitesimals. Thus the infinitesimal triangle 
BAqA is isosceles to the first order, and therefore the angle 
BAAq is a right angle : thus AqA is normal to /S at ^ . Thus all the 
normals to are normals to 8, and this is true for any value of s. 
Hence the congruence of normals to Sq is a normal congruence. f 
It is obvious that the rays emanating from a point P in an 
ordinary medium form a normal rectilinear congruence, having 
for normal surfaces the family of spheres with centre P. The 
theorem of Malus asserts that a normal rectilinear congruence 
remains normal after reflection or refraction, and hence that the 
congruence formed by any number of refiections or refractions 
from the congruence of rays originally emanating from a point 
is a normal congruence. 

s 


Fig. 4 

To prove the theorem of Malus, let be a ray of the incident 

congruence, incident on the reflecting or refracting surface at P, 
and let P-4 be the reflected or refracted ray (Fig. 4). Let S' be the 
normal surface to the incident congruence (normal by hypothesis) 
at .4'; let P'be an adjacent point on S' and B'QB the ray through 
B', incident at Q. The point B is taken so that 

(3-10) [B'QB] = [.4'P^]. 

Joining B'P, PB,-we have to the first order, by Fermat’s principle, 
(3-11) [B'PB] = [B'QB], 

f The result also follows immediately from the equation 
i:{x - Xq) {8x - SXq) = 0, 

whore x, y, z are coordinates of A and x^^ Zq those of Aq. 
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Hence 

(3-12) [B'P] + [PB] = [A'P] + [PA\, 

but since A'B' is perpendicular to A'P, we have [-B'P] = {A'P], 
to the first order, and hence [PP] = [P-4]. Thus to the first 
order, PB = P-4, which shows that the infinitesimal displace- 
ment ^ P is perpendicular to P-4 . 

Now if a surface S is formed by taking points on the reflected 
or refracted rays so that the optical length from each point on S' 
to its correspondent on S is [-4'P-4], it follows from the result 
established above that P-4 is perpendicular to every infinitesimal 
displacement at A on S. Therefore PA is normal to 8 a,t A. Thus 
all the reflected or refracted rays are normal to S. By varying 
the position of A on the reflected or refracted ray from A' , we 
get a single infinity of siirfaces to which the final rays are normal. 
Thus the theorem of Malus is established. 

4. The construction of Huyghens. 

Throughout the history of the science of optics, two rival 
theories have developed side by side — the corpuscular theory and 
the wave theory. Tn the corpuscular theory the phenomenon of 
light is regarded as due to the motion of corpuscles (or quanta in 
modern language), which are individually localized in small regions 
of space, so that ideally they may be regarded as points. The 
tracks of these particles are the rays. Tn the wave theory, light is 
regarded as due to the propagation of a system of waves. At the 
present time it is impossible to be dogmatic concerning the cor- 
rectness of eitlier view. However, in geometrical optics it is possible 
to regard the two theories as different aspects of a single theory. 

To develop the wave theory of light in geometrical optics we 
follow the construction of Huyghens. Let Z' be a surface which 
represents a wave of disturbance in the medium at time i' . Let 
each point A' of Z' be regarded as the centre of a secondary 
disturbance which spreads out from A' in all directions. For an 
anisotropic medium it is necessary to distinguish between the 
ray -velocity and the wave-velocity, but we are here concerned only 
with ordinary media, in which it is assumed that the velocity of 
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propagation of the secondary waves is the same as the ray -velocity 
V introduced in § 2. 

Let us now consider the disturbance at time t\ we have then a 
number of secondary waves, each of radius — whose points 
fill a layer of space including the surface S'. These spheres have 
an envelope consisting of two sheets, one sheet on each side of S'. 
We assume that the wave S' has a sense of propagation to one side 
or the other, and we assume that the wave at time t is that sheet S 
of the envelope of the secondary weaves which is such that passage 
from S' to S is in the assumed sense of propagation. In Fig. 5, 



A' , B' are centres of secondary waves and A, B the points of 
contact of these waves with the envelope S. 

In the above statement it is assumed that the secondary waves 
do not cut the boundary of the medium. To deal with such cases, 
in which reflection or refraction takes place, we have to proceed 
by infinitesimal steps. When a point of the wave lies on a boundary 
of the medium, it becomes the centre of a secondary wave which, 
in the case of reflection, has a sense of propagation back into the 
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medium, and, in the case of refraction, goes on into the second 
medium, but with a different velocity. It is evident that the con- 
struction of Huyghens leads to definite reflected and refracted 
waves. These will be discussed below. 

First, however, let us consider the propagation of a wave in a 
single medium, without reflection. We shall estabhsh the fol- 
lowing facts concerning propagation according to the construc- 
tion of Huyghens. Given a wave Z' at time t', the resultant wave S at 
time t is the same whether developed by the constribction of Huyghens 
in one step or in several steps : also, the ncyrmals to Z' are normal to Z, 
and if A' A is one of these normals, with A' on Z' and A on Z, then 
A is the point of contact with Z of the secondary wave having its 
centre at A'. 

Fig. 6 shows the wave Z reached in four steps from Z ' ; Z^, Z^, Z^ 



Kig. 6 


are the intermediate waves. A' is the centre of a secondary wave 
which touches Z^ at A^. B' is adjacent to A' on Z' and its second- 
ary wave touches Z^ at It is implied in the construction that 

(4-1) A'A^ = B'B^. 

Since the spheres touch their envelope, we have, to the first order, 

(4-2) B'B^ = B'Ai\ 

hence by (4-1) 

(4-3) A'A^ = B'A^, 

which shows that A' A normal to Z'. It is of course also normal 
to Zi since, as radius of the sphere with centre Jl', it is normal to 
the tangent plane to that sphere at A^, and this plane is also the 
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tangent plane of Hi. Hence A'Ai is normal to both the waves 
which it connects, and its length is v{ti — t'), if t', ti are the times 
for H', Hi respectively. Continuing the construction, we see that 
A 1 A 2 , being normal to Hi, lies in the same straight Une with ; 

it is of length v{t^ — ti), where is the time for H^. Hence we see 
that, by the application of the four steps shown, the secondary 
waves with centres A', Ai, A^, A^ lead finally to a contact A with 
the final surface H such that A lies on the normal to H' aA, A' at 
a distance 

(4-4) A'A = v(ti - 1') + v (<2 - <i) + v{t^ - h) + v{t - < 3 ) = v{t - 1'), 

where t is the time for H. Since .< 43 ^ is normal to H, so also is 
A' A. But it is now evident that if only one step for the time 



z 

Fig. 7 

interval t — t' were employed, we would get a wave such that its 
point of contact with the secondary wave with centre A' would 
lie on the normal to H' at A' at a distance v(t — t'), i.e. precisely at 
the point A. The result italicized above is therefore established. 

Let us now consider reflection or refraction according to the 
construction of Huyghens. Since the treatments for reflection 
and refraction are almost identical, it will suffice to consider 
refraction. 

Light travels from a medium M’ into a medium M across a 
surface S (Fig. 1). H’ and H represent positions of a wave at times 
t’ and t respectively; P and Q are any two adjacent points on 
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S; A'P, B'Q are normals to the family of incident waves, and 
PA, QB normals to the family of refracted waves, A', B' lying 
on Z' and A, B on Z. Now we know from the construction of 
Huyghens that the times taken to traverse B'QB and A' PA are 
the same, or in terms of optical lengths 

(4-5) [B'QB] = [A' PA]. 

But from the normal property, we have to the first order 

(4-6) [B'Q] = [A'Q], [QB] = [QA], 

and hence 

(4-7) [A'QA] = [A' PA]-. 

in fact the optical length measured along the normals to the 
waves from A' to A has a stationary value. Hence {2-17) may be 
established as the law of refraction for wave-normals by applying 
the stationary condition as in § 2. 

Seeing that reflection may be treated in the same way, we may 
state the following result, which reconciles the ray theory and 
the wave theory in geometrical optics, as far as ordinary media 
are concerned. Given a wave Z' in a medinm M' , there is deter- 
mined by the construction of Huyghens a system of waves Z after 
reflection or refraction. Given a system of rays normal to Z' in M', 
there is determined by Fermat's principle a system of rays after 
reflection or refraction. This latter system of rays is normal to the 
waves Z. 

If light starts from a point source A', the waves are spheres 
having A' for centre; the rays arc the radii, normal to the spheres. 
By the result just established, the normality of rays and waves 
is conserved over each reflection and refraction. Thus if we think 
simultaneously of the rays and the surfaces to which they are 
normal, we have in mind at the same time the two theories of 
rays and waves. 



CHAPTER II 


THE CHARACTERISTIC FUNCTIONS FOR IN- 
STRUMENTS FORMED OF ORDINARY MEDIA 

5. The characteristic function V. 

Let us consider an instrument formed of n. -I- 1 media with 
indices of refraction , 

5 /^1> /^^2’ • • *5 /^n— 1’ 

separated by surfaces 8^, 82, 8„ . 

It is simplest to suppose that only refractions take place: if a 
reflection takes place, the medium in which it occurs is counted 
twice over, the same analysis applying. 



Let A'Pj ... be a ray traversing the instrument. A' lying 
in the first medium and A in the last. By Fermat’s principle we 
know that this ray has a stationary optical length when com- 
pared with adjacent unnatural paths joining A' to A. 

Let Oxyz be rectangular axes of coordinates. t Let x', y', z' be 
the coordinates of A' and x, y, z those of A. If these six numbers 
are given, points A', A are determined; hence by Fermat’s 
principle a ray A' A and a corresponding optical length are also 
determined, j: The characteristic function V{x', y', z', x, y, z) of the 

t Wo might employ different axes for the initial and final media, but since it is 
at times necessary to use a single set of coordinates for both media, we shall, to 
avoid confusion, use a single set throughout. 

} It may happen that there is no ray joining A' and A : then V is not defined 
for that pair of points. 


SCO 
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instrument is defined to be the optical length of the ray from the 
point A'{x', y', z') to the point A{x, y, z) : 

(5-1) V(x',y',z’,x,y,z) = [A'P^.^P^A]. 

To distinguish it from the other characteristic functions to 
be defined later, V may be called the point-characteristic. 

Passing from the ray A' A to an adjacent ray . . . Q^B, and 

denoting the coordinates as follows: 

B’: x' + Sx\ y' + 8y', z' + Sz’, 

B: x + Sx, y + 8y, z-{-dz, 

we have, for the increment in V, 

(5-2) dV = Z,8x' + V^,8y' -tK'Sz'-h VJx + V^8y-i- VJz 
= £V^,8x' + lTJx, 

the subscripts denoting partial derivatives. But by Fermat’s 
principle we have 

(5-3) [B'P,...P^B] = [B'Qi... Qn-B], 

to the first order, and hence 

(5-4) 8V=[B'Q,...Q„B]-[A'P,...P„A] 

^[B'P,...P,,B]-[A'P,...P^A] 

= [B'P,]-[A'P,]-i-[P„B]-[P,A] 

= p' 8p' -t-fiSp, 

where 

(5-5) p' = A'P^, p' + 8p' = B'P^-, p = P,^A, p + 8p = P„B. 

Let 

fa', /?', y' — direction cosines of A'P ^ , 

(5-6) , 1’ 

fa, fi, y = direction cosines of .4. 

Then, as in § 2, we see that 

^8p’ = - (a' 8x' + /?' 8y' -^y'8z') = - Pa' 8x', 

= a8x-i-fi8y + y8z = P<x8x, 
and hence, by (5-4), 

(5-8) 8V = —p' Ba' 8x' -i-pBa8x. 

Comparing (5’2) and (5*8), and noting that therefore 
(5-9) 8x' + SV^ 8x = —p' Ecx! 8x' +pEc(, 8x 
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(5-10) 


for arbitrary values of the infinitesimals, f we see that 

tF, =/iot, p; = Z = M'Y- 

It is convenient to introduce the components of the initial and 
final rays, defined by 

f<r'=/*V, T'=p'^’, v’=/iy, 

[(T = /la, T = pfi, V = fiy. 

In terms of them, (5- 10) may be written 

V. = -(t' V. = -t' V. = -v' 

Jx = O'. = 7'. V^^v. 

We note that between the components there exist the identical 
relations 


(5-11) 


( 6 - 12 ) 


(5-13) o-'^ + r'^ + y'^ = = /^2. 

Hence, by (5-10) or {5-12), it follows that the characteristic 
function V satisfies the two partial differential equations J 

(5-14) V%+Vl.+ Vl = Vl+ Vl+Vl = pK 

In dealing with the behaviour of an optical instrument we have 
under consideration primarily the following twelve quantities: 

'x', y', z', coordinates of a point on the initial ray, 

<r', t', y', components of the initial ray, 

(5-15) ^ 

X, y, z, coordinates of a point on the final ray, 

,(r, T, y, components of the final ray. 

These twelve quantities are not all independent on account of 
the two identities (5*13). 

The following questions may be asked : 

(«) Given the coordinates of initial and final points, what are 
the components at them of the ray passing through these 
points ? 

t In certain special cases these six infinitesimals are not independent : this will 
occur when the congruence of rays from a point B', chosen arbitrarily in the 
neighbourhood of A', fails to pass through all points of a three-dimensional region 
containing A, We shall, for simplicity, exclude such cases from consideration. 

X Hamilton’s dynamical theory was very closely related to his optical theory. 
Either of the equations (5*14) will be recognized as the Hamilton- Jacobi equation 
for a particle moving under no forces. 
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(b) Given an initial point and the components of an initial ray 
through it, what are the components of the final ray and 
the coordinates of a point on it ? 

(c) The same as (6), with interchange of the words “ initial” 
and “final”. 

(d) Given an initial point and the components of a final ray, 
what are the coordinates of a final point and the com- 
ponents of the initial ray? 

(e) The same as (d), with interchange of the words “initial” 
and “final”. 

(/) Given the components of initial and final rays, what are 
the coordinates of points on them? Or, in other words, 
where are the rays situated? 

Later methods will show us how to answer (d), (e), (/). For the 
present we remark that if the characteristic function V of an 
instrument is known, the equations (5- 12) immediately supply 
the answer to question (a). The difficulty in the useful application 
of (S' 12) lies in the difficulty of calculating V for an actual 
instrument. 

The function V defined above is a function of six variables, 
namely, the coordinates of initial and final points. It is defined 
by the instrument. If we are merely interested in the final con- 
gruence of rays due to a source at a fixed point x' , y', z', it is no 
longer necessary to emphasize the dependence of V on x', y', z', 
and we may consider it as a function of x, y,z. We then think of 
V{x,y,z) as the characteristic function of the final congruence of 
rays, the components of these rays being, as in (5'12), 

(5-16) 'r = I^, v = V^. 

The characteristic function for a normal congruence of rays in 
a medium of index /i may also be defined in a slightly different, 
but essentially equivalent, manner as follows. Let E (Fig. 9) be 
any normal surface of the congruence and let P{x, y, z) be any 
point. Let MPhe the ray through P, M being on E. Let us define 

(5*17) V* = [MP] = (iMP-, 



THE CHARACTERISTIC FUNCTION V 


21 


V* is a function of x, y, z. If we displace P to 
Q(x + dx,y + 8y, z + Sz), 

we have 
(5-18) 

SV* = 2:V*Sx = [NQ]-[MP] = /I(NQ-MF) = /iSMP. 



Fig. 9 


Since the projection of on JfP is (to the first order) equal 
to NQ and since MN is perpendicular to MP, SMP is equal to 
the projection of PQ on MP, and so 

(6-19) 8MP = 2:adx, 

where a, /?, y are the direction cosines of MP. Hence, the com- 
ponents being defined as before by (5-11), we have 

(S' 20) 8V* = fi Ea 8x = Ecr 8x, 

which leads us at once to (S' 16), with V replaced by V*. It is 
easily seen that when the final congruence of rays comes originally 
from a point source, the V* as just defined differs only by a 
constant from the V discussed earlier. 

It is possible to design a mirror to reflect to an assigned point 
A all rays of a normal congruence. For if E (Fig. 9) is a normal 
surface to the congruence, such a mirror is given by the locus of a 
point P such that mP + PA = const. 

The proof follows at once from Fermat’s principle. Similarly, we 
can design a refracting surface of material of any assigned index 
to bring a given normal congruence after refraction to an assigned 
point A in the material. These mirrors and refracting surfaces 
may be called /oca? reflectors and refractors. 
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Similarly, a mirror or refracting surface may be found to turn 
any normal congruence into a parallel congruence in an arbi- 
trarily assigned direction. 

Let us return to the general point of view, according to which 
V is regarded as a function of six variables. According to the 
definition it would appear that x', y', z' may be the coordinates of 
any point in the initial medium and x, y, z the coordinates of any 
point in the final medium. Actually, however, these points 
cannot range right through their respective media, because it is 
implied in the definition that it is possible for a ray to pass from 
the one to the other, and this will not in general be the case for 
all pairs of points. Thus in general the ranges of x' , y', z’ and of 
X, y, z are only parts of the initial and final media respectively. 




It is, however, possible and convenient to “continue” the 
function F for values of x' ,y' ,z' , x, y, z corresponding to points 
which do not lie in the initial or final media, but lie on initial or 
final rays produced. Thus, in Fig. 10, A'{x' ,y' ,z') lies on the 
initial ray produced and A{x,y,z) on the final ray produced 
backwards. We may proceed from A' to A by first going along 
A'B', the production of the initial ray, then through the instru- 
ment from B' to B, and then along BA, the production of the 
final ray. We define the optical length of this route from A' to 
A as 

(5-21) [A'B'] + [B'B] + [BAl 

where {A'B'I is an optical length, calculated as if the index were 
fi' , that of the initial medium, and counted negative because 
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described in a negative sense, and \_BA\ similarly calculated as if 
the index were /i, that of the final medium, and also counted 
negative. We then define V{x' ,y' ,z' ,x,y,z) as the optical length 
(5*21) computed in this way. It is easily seen that Fermat’s 
principle holds for optical lengths interpreted as above, and 
furthermore that the fundamental relations (5’ 12) also hold for 
the function V continued in this way. 

As remarked above, the utility of the function V is restricted, 
owing to the difficulty of calculating it. For such a simple instru- 
ment as a plane mirror, however, we can write it down from 
elementary considerations. If the mirror is z = 0, we have 

(6-22) V = V(*' - xf + {y' - yf + {z' + zf. 

For a set of three mirrors at right angles to one another, coincident 
with the planes a; = 0, y = 0, z = 0, we have 

(5-23) V = 4{x' -h xf+\y' + yf+ (z' -f- z)^. 

These simple results may be deduced by the elementary 
method of images. To calculate V for a general instrument, as in 
Fig. 8, we proceed as follows. Let a;,., y^, z^ be the running co- 
ordinates of a point on the surface aSj, and let the equation of 
this surface be 

(5-24) fi{Xi,yi,Zi) = 0. 

Let us draw any path of straight segments from A'{x', y' , z') to 
A (a;, y, z) : if Pj, P 2 > • • • > -P/i are the points where this path meets the 
surfaces, its optical length is 

(5-25) L = ii'A'Pi+ ^ 

i=\ 

This can be expressed easily as a function of 

(6-26) x',y',z',x,y,z,Xi,yi,Zi 1, 2, ...,%). 

By Fermat’s principle we know that L has a stationary value for 
the natural ray for small variations of P^, . . . , P„ on their respective 
surfaces. Thus 


(6-27) 


i(-- 


. dL - dL 
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Consequently 


(5-29) 


dx] dy, '"‘Sy/ 


(i= 1,2,. ..,%), 


(i=l,2, ...,n). 


dL ^ a/, 


the A’s being undetermined multipliers. In (5*24) and (5'29) we 
have 4 to equations for the An quantities a;^, yi, Zj-, Aj- (i= 1, 2, ...,n), 
and if these quantities are found, and the values of Xi, y^, Zi sub- 
stituted in (5-25), we have the characteristic function 
(5-30) V(x', y', z', X, y, z) = L. 

Although theoretically simple, the solutions or eliminations 
demanded by the method usually prove very difficult; the func- 
tions W and T to be defined later are easier to calculate as a rule. 


6. The characteristic function W. 

Consider a ray passing through an instrument. Let A' be a 
point on the initial ray and N the foot of the perpendicular 
dropped from the origin 0 on the final ray (Fig. ll)t. We define 
Why 

(6-1) W = [A'W]. 



If A' is assigned as a source of light, there will in general be final 

I DifiFerent axes may be used for the initial and final media, but for simplicity 
we shall employ a single set of axes for both media. 
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rays with all directions in a certain range of directions, and at 
most a finite number of final rays with an assigned direction. 
Thus if x', y', z' are the coordinates of A' and <r, t, v the com- 
ponents of the final ray (connected of course by the identity 

(6'2) cr2 -t- -f = ^2^ 

so that V is determined by cr, t to within an ambiguous sign), we 
may say that IT is a function (possibly multiple-valued) of the 
variables x' , y', z', cr, r, or 

(6-3) W=W{x',y',z',cr,T). 

W is to be regarded as a second characteristic function of the 
instrument. It may be called the mixed characteristic. 

It is evident that the continuation process described in §5 
enables us to take initial points x', y', z' which do not lie in the 
initial medium, but on the initial rays produced, and to employ 
the above definition for W even though the perpendicular OiV 
falls not on the final ray but on the final ray produced backwards. 

We shall now show the connection between V and W. Let 
A{x, y, z) be any point on the final ray. Then it is easily seen by 
orthogonal projection of OA on the final ray that 

(6-4) [^- 4 ] = [JiSax = Ecrx. 

Hence 


(6-5) V(x', y’, z’, X, y, z) = [A' A] = -f [NA] 

= W(x',y',z', cr, t) + {a-x + ry + vz). 

Let us now give arbitrary variations to A' and A : this causes 
variations in the components of the final ray. Differentiating 
(6-5), we have 

( 6 ’ 6 ) HVj^Sx' + HV^Sx 

= r W;, Sx' + W^Sa + W, St + i:<rSx+ Zx Scr. 

By (5‘12) we have 

(6-7) ZV^ Sx' = - Zo-' Sx', rF, Sx = Za- Sx, 


and by (6-2) 
( 6 - 8 ) 


. ctSct + tSt 

Sv = — 


V 
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Thus (6*6) becomes 

(6-9) - So-' Sx' = EW^. Sx' + WJ(r+WjT 

+ {x — zcr/v) 8(r+(y — ZTjv) St. 

But Sx', 8y', 8z', 8(r, 8t are arbitrary and independent. f Hence 

( 6 - 10 ) 

(6*11) x — zcrlv = —W^, y — zTjv = —W,.. 

We are now in a position to answer question (d) raised in § 5, 
if we suppose the function W known. { For given an initial point 
x', y', z' and the components of a final ray cr, t, v, we know the 
values of the partial derivatives of W; hence (6’ 10) give us the 
components of the initial ray and (6-11) establish connections 
between the coordinates of any point on the final ray. In fact, 
(6-11) are the equations of the final rays. In particular, if there is a 
source of light at x',y' ,z', the congruence of final rays is given by 
(6*11), (T, T, V taking arbitrary values subject to (6-2). 

Let us now consider how W is to be calculated. We shall, 
however, confine ourselves to the case of an instrument involving 
only one reflection or refraction, because the extension of the 
method to the case of a general instrument simply requires the 
combination of the reasoning now to be given with that already 
given for F in § 5. 



In Fig. 12 A'{x', y' , z') is a given point: QN is a directed line in 

t In certain special cases these five infinitesimals are not independent: this will 
occur when the congruence of rays from a point B', chosen arbitrarily in the 
neighbourhood of A', fails to give final rays having all directions adjacent to the 
direction a, t. We shall, for simplicity, exclude such cases from consideration. 

J The answering of (e) merely involves the interchange of initial and final media. 
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the final medium with assigned components cr, t, y, but A' QN is 
not a natural ray in general, the law of refraction or reflection not 
being satisfied at Q. We can, however, express as a function of 
x', y', z' , x^, yy, Zy (the coordinates of Q) and cr, t , v the optical 
length \A'QN'\, where N is the foot of the perpendicular from the 
origin 0 on QN. It is in fact 


(6-12) L = [A'QN] = n'{S{x'-Xyf)^-S(TXy. 

Now let A’ PM be a natural ray, the components of PM being 
cr, T, V and M being the foot of the perpendicular from 0. The 
plane OMN is perpendicular to the common direction of PM and 
QN, and hence if PQ is infinitesimal, MN is an infinitesimal 
displacement on the normal surface through M to the final rays 
of the congruence from a source sA> A' . Therefore to the first order 

(6-13) [A'QN^ = [A'QM] = [A' PM], 


Thus, letting Q tend to coincidence with P, we see that the co- 
ordinates Xy, yy, Zy at P Rre such that for an arbitrary displace- 
ment dxy, 8yy, 8zy On the surface S we have 8L = 0. Therefore if 
the equation of 8 is 
(6-14) = 0, 

we have for the natural ray 


(6-15) 


_ 0/ 

d^y ~ ^dXy’ 


0L _ ^ 

where A is undetermined. Explicitly we have 


dL_ df 

dZy~^dZy’ 


(6-16) 


where 


{ /l'{x'-Xy) 0/ 

P' dXy’ 

T = 

p ^yi 

— u = A _ - , 

P 9^1 


(6-17) p'^=-Z{x'-Xyf. 

If we eliminate A, solve for Xy, yy, Zy from (6-14) and (6- 16), and 
substitute in (6*12), we have the function W, 
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(6-18) W(x',^',z',o-,T)r^L, 

( 6 - 2 ) being used to express v in terms of cr, t. 

The calculation of IT is generally difficult, but not as difficult 
as that of V. Let us calculate W for refraction through a plane. 
Let the plane z = 0 be the plane of refraction, separating the 
initial medium z < 0 of index /i' from the final medium z > 0 of 
index (Fig. 13). 



Since Zj = 0, the optical length L of {6-12) is 
(6-19) L = n'{{x' - Xif + iy' - + z'^ji - (rx^ - 

and this is to be made stationary for arbitrary variations of 
Vi, so that in consequence 


( 6 - 20 ) 

Thus 

( 6 - 21 ) 


{/I'tx' —xf) ^ —y-i) 

; — L' + (r = 0, < 1/ + T = 0, 

\ P P 

[p' = {(x'-a;i)2 + (y'-yi)2 + z'2}*. 


0-2 + T2 = 

p' = - 


2 _ 


V2 


,2 _ _ ^ 




2_n-2 


cr^-T^ 


ll'z' 


(p'2-<r2-T2)i’ 


the negative sign being taken in the last expression because 
p' > 0, z' < 0, 
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Then 

(6-22) Xi = x' + crp'l^', y^==y'+ Tp'j/i’, 

axi + ry^ — <tx' + ry' + (tr^ + t^) p'(p', 

and so 


(6-23) W = L = p'p' — ax^ — Ty^ 


p'H' 


(^'2_o-2_T2)i 


, , z'(0-2 + t2) 

-tra; -ry ^ 


(i“ 


(7“ 


7-2)i 


= — ax' — ry' — — a^ — t^)*. 


This is the characteristic function W for refraction across the plane 
z = 0. 

The equations (G-ll) give as the equations for the final rays 


(6-24) 


O' 

X — Z- = X 
V 


y-z~ = y 


z a 


(^'2_or2_T2)i> 

z't 


iji'^-a^-T^r 

since — a^ — these equations may be written 

z z' 

-T^)i 


/n.25) 

(«25) ^ ^ 


■a^-r^Y (p'^-a'^-T^)^' 

If A\x' ,y' ,z') is a point source of light, we observe that a final 
ray with components cr, r cuts the normal from to the re- 
fracting plane at 


(6-26) 


x = x , y = y , z = z 


the value of z may also be written 


(/* 


'2. 


.T2)i’ 


(6-27) 


z = z 


/*y 

'^|p’^-p^^~y^y 


a value easily checked by an elementary argument based on the 
law of refraction (2-18). 


7. The characteristic function T. 

Let N', N hQ the feet of the perpendiculars dropped from the 
origin O on the initial and final portions of a ray passing through 
an instrument (Fig. 14). f We define T by 

(7-1) T = [N'N], 

t As for V and IF, we might employ two systems of coordinates, but we shall not 
do so. 
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We shall confine our attention to those instruments for which the 
directions of the initial and final portions of a ray define the ray 
completely, or at most define a finite number of rays. This will 
not be the case, for example, if the instrument is a cyUndrical 
mirror, because then a set of parallel rays incident along a 



generator give rise to a set of parallel final rays, and so the 
initial and final directions do not define a complete ray. The same 
applies to the case of refraction at a cylindrical surface, and, more 
generally, to reflection or refraction at a developable surface. 
Such instruments will not be considered. 

Since by hypothesis the components <r', t', v', tr, t, v determine 
a complete ray, it is evident that T is a function of cr' , t', a, t, 
since v', v are given in terms of these four quantities by the 
identities (5*13). Thus we may write 

(7-2) T = T((t',t',(t,t). 

T is to be regarded as a third characteristic function of the in- 
strument; it may be called the angle-characteristic. The con- 
tinuation process employed for V and W is available, and it is 



THE CHARACTERISTIC FTTNCTIOH T 31 

not necessary that N' and N should lie respectively on initial 
and final rays: they may lie on these rays produced. 

We shall now show the connections between T and F and W. 
If A'(x', y' , z') and A{x, y, z) are any points on the initial and final 
rays respectively, we have 

(7-3) V = [A' A] = [A'N'] + [N'N] + [NA], 
or 

(7-4) V(x\y',z',x,y,z) 

= T{a-', t', (T, t) - [ax' + r'y' + v'z') + (orx + Ty + vz), 

or, by (6-5), 

(7-5) W{x', y', z', tr, r) = T{(r', t', <t, t) - (ar'x' + r'y' + v'z'). 

Let us now give arbitrary variations to A' and A, with con- 
sequent variations in the components of the initial and final 
rays. Differentiation of (7*4) gives 

(7-6) SV^.8x' + EVJx = 8<t' + T,. 8r' + TJ(r + 2\ St 

— Z<r' Sx' — Hx' 8a' + Ua Sx + 2Jx 8a, 

the subscripts as usual denoting partial differentiation; hence, 
by (5-12) and (6-8), 

(7-7) -x' + z'a'/v') 8a' {T,. -y' + z't'/v') St' 

+ {Tg. + x — zalv)8a+{T^ + y — zTlu)8T = 0 . 
But the four differentials occurring here may be regarded as 
arbitrary and independent; therefore 

(7-8) x'-z'a'Iv' = T^, y' -z't'/v' = T^., 

(7-9) x — zalv — -T^, y-zTjv = —T^. 

The function T being supposed known, (7-8) are the equations of 
initial rays and (7-9) the equations affinal rays. Thus a knowledge 
of T provides us with an answer to the question (/) raised in § 5. 

Let us now see how the function T is to be calculated for a 
given instrument, starting with a simple instrument in which 
only one refiection or refraction at a surface 8 is involved. 

Let a' , t' , a, t, the initial and final components, be assigned, 
and let N'QN be a broken line, N'Q having components a', t' 
and QN components a, t and N', N being the feet of perpen- 
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diculars dropped from the origin 0 (Fig. 15). If a;, y, z are the 
coordinates of Q, we have for the optical length 

(7-10) [N'QN] = Z<r'x — Ea'x. 



Now let M'PM be the natural ray with the assigned components, 
M', M being the feet of perpendiculars from O, and let Q be ad- 
jacent to P. It is easily seen, as in § 6, from Fermat’s principle, 
that to the first order 

(7-11) [N’QN^ = [M'PM]. 

In fact, for arbitrary variations of Q on the surface 8, 
has a stationary value for the natural ray. Thus, by (7- 10), 

(7-12) E(<r' -(t)8x == 0 

for all variations 8x, 8y, 8z on 8. Consequently, the vector with 
components 

(7-13) <r-cr', t-t', v-u' 

is normal to 8 at the point of incidence, as indeed we already 
knew from (2*13) for reflection and from (2*17) for refraction. 
The mode of evaluation of T depends on the analytical form 
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in which it is convenient to represent the surface. We have, 
by (7-10), 

(7-14) T{cr',T',cr,T) = (tr' - tr) x + (t' - t) «/ + (y' - v) 2 , 

from which x, y, z are to be eliminated by the stationary property 
(7-12). 

If the surface 8 is given in the form 
(7-15) F{x,y,z) = 0, 

then (7*12) tells us that 

(7-16) (r-(r' = XF^, t-t' = XF^,, v-v' = XF„ 

where A is undetermined, and the subscripts indicate partial 
derivatives. Our procedure then is to eliminate A, x, y, z from the 
five equations (7-14), (7-15), (7-16): we are also to use (5-13) to 
eliminate v' and v. 

If the equation of 8 is given in the form 
(7-17) z=f{x,y), 

we know that the direction cosines of the normal have the ratios 


( 7 - 18 ) 

where fy are partial derivatives. Hence 
cr — cr' 
v — v' 


(7-19) 


-fx 


v~v' 


If these two equations are solved for x, y, and z then found from 
(7’17), we may substitute in (7-14), and so by means of (5-13) 
obtain T as a function of the required arguments a-', t', <t, t. 

There is yet a third method, dependent on a knowledge of the 
tangential equation of the surface 8. Let Z, m, n be the direction 
cosines of the normal to 8 at the point of incidence, and let p be 
the perpendicular distance from the origin 0 to the tangent plane 
to 8 at tills point. The tangential equation of 8 is then of the 
form 


(7-20) p = 0{l,m,ny, 

since l^ + m^ + n^ = 1 this may be written in the form 


(7-21) 



SGO 


3 
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But 

(7*22) (T — cr' = Xl, T — t' = Ato, v — v' = Xn, 
where A is undetermined. Then (7*14) gives 
(7*23) T = — X(lx + my + nz) = — Ajp. 

But from (7*21) and (7*22) 


(7*24) 




T — 
V — 



A^ = (<T — (r')2+(T — t')^ + (i; — 


Therefore 

(7*25) T = ±[(a-<Ty + {r-r'f + {v-v'n<l>{^-^ 


T — r 
v — v 


')■ 


from which v', v are to be eliminated by (5*13). The explicit 
ambiguity in sign and those implicit in v' and v are to be 
removed by inspection in any particular case. 

Let us now consider the calculation of T for a general instru- 
ment formed of any number of media. We shall use the notation 
of Pig. 8, § 5, and put 

cr', t', v' = components of initial ray, 

O’, r, V = components of final ray, 
a■^, T^, Vi = components of ray in medium Mi of index Hi 


(i= 1, 2, 1). 


Any two consecutive media may be regarded as forming an 
instrument. Let be the characteristic function for the 

instrument formed by the media Mi. Now a ray traversing 
Mi may be regarded either as a final ray for the combination 
Mi_i, Mi or as an initial ray for the combination AQ, If 

Xi, yi, Zi are the coordinates of any point on a ray in Mi, we have 
then, by (7*8), (7*9), 


(7*26) 


Xf 2, - — Ti_i i, yi Zi ^ Ti_i^i 


Ti 3 


' V. 


r — 2 — = ^ T 

•*'i ■‘■i.i+V 

'^i 


d(Ti 
9cr,. 


dti 


'’’i 3 _ 
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Hence by subtraction 

(7*27) + = 


^{Ti-i,i + Ti,i+\) = 0 , 


these equations holding for i = 1,2, 1, the subscripts 0 

and n being attached to the initial and final media respectively. 

Consider now a ray traversing the complete instrument. From 
its definition as an optical length, it follows that T for the whole 
instrument is the sum of these functions for the simple instru- 
ments formed from pairs of consecutive media, that is, 

(7-28) ^ = ^0,l + ^l,2+ ••• + 

Now in the functions on the right there are involved all the 
components tr^, for the rays in all the media. But any particular 



pair (Ti, Ti enter only into < and Hence it follows from 

(7*27) that T as given by (7‘28) has a stationary value with respect 
to arbitrary variations of the intermediate components. Thus we 


3-2 
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have the following rule for the calculation of T: Find the cha- 
racteristic Junction for each pair of adjacent media and add to- 
gether the results. Eliminate the intermediate components by means 
of the equations which express the fact that the sum so obtained has 
a stationary value with respect to arbitrary variations of the inter- 
mediate components. 

The jT-function of a system will change when the axes of co- 
ordinates are changed. Let Oxyz, Oxyz be two sets of parallel 
axes, the point 0 having coordinates a, b, c relative to Oxyz. 
Denoting by T, T the functions for these two systems of axes, 
we have (Fig. 16) 

(7-29) T = [N'Nl T = [N'NI 

and hence, since N'N', NN are the projections of 00 on the 
initial and final rays, 

(7-30) T = T-\-a{^ar — (x') + b{T — T') + c{v-v'). 

8. The T -function for reflection or refraction at a sphere 
or a paraboloid of revolution. 

The method of calculating T by means of the tangential 
equation (7*21) is convenient when the reflecting or refracting 
surface is spherical. Let us take the origin at the centre of the 
sphere: then p = E, where R is the radius of the sphere, and 
hence the function <j) of (7-21) is simply a constant, 

( 8 - 1 ) = 

\n nj 

Hence by (7-25) the T-function for a sphere is 

(8-2) T = ±Rl{ar-(Ty + {j-ry + {v-v'f-f‘ 

= — 2((ro-'-4-TT' + yi;')]i, 

from which the ambiguous sign is to be removed by special con- 
siderations. Should we wish to remove the origin from the centre 
of the sphere, we may use (7-30). The formula (8-2) apphes both 
to reflection and to refraction. 

To show how all ambiguities of sign are to be removed, let us 
consider internal reflection (Fig. 17 a) and external reflection 
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(Fig. 176). By consideration of the optical path, it is clear that 
T is positive in the former case and negative in the latter. This 
determines the sign in (8-2). Moreover, the axes being as shown, 
we have (the positive square root being indicated in each case) 



Fig. 17 o Fig. 176 


(8-3) Fig. 17a: 

0>V’ = -(;i'2-(r'2-T'2)l = 

0<u = — — — 

(8-4) Fig. 176: 

0<i;' = (/2-(r'2-T'2)i = /t'(l-a'2-/?'2)5, 

0>i; = — — — p{\ — a? — 

where, of course, p' = p, this being the refractive index for the 
medium in which the rays lie. Thus, with all ambiguities removed, 
we have 

(8-5) Fig. 17o: 

T = J?[2/i2 _ 2{(T(T' + tt' + vv')]i 
= E — <x(t' — tt' + {p^ — (T* — T^)* {p^ — cr'^ — t'*)1]* 

= pE V2[l - aa' - + ( 1 - a* - 
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(8-6) Fig. 176: 

T = — E[2/i^ — 2((r(r' + tt' + w')]* 

= —B Vsiyi* — cr(r' — tt' + — cr® — t®)* (/t® — < 7 '® — 

= - /ti2 V2[l - oa' - /^/ + ( 1 - a2 - ^)i ( 1 - a'^ - . 

As usual, a', are direction cosines of the incident ray, and a, 
direction cosines of the reflected ray. 

Let us now consider the case where the reflecting or refracting 
surface is a paraboloid of revolution. Let us take the origin at 
the vertex and the 3-axis along the axis of revolution. Then the 
equation of the surface is 

(8-7) ^ = + 


R being the radius of curvature at the vertex. 
By (7-14) we have 


(8-8) T = {tr' -(r)x + {T' -T)y + {v' — v)z, 

where this is to have a stationary value with respect to variations 
of X, y, z on the surface. Thus 


(8-9) + = = 0. 


But by (8*7) 
( 8 - 10 ) 
hence 

( 8 - 11 ) 


dz X dz y 
dx B’ dy B’ 


TjO- -cr „T -T 

x = —R—, , y = —R—, , 

U —V ^ V —V 


z = \R 


(cr' -rrY + {r' -Tf 


(v'-vf 

Substitution in (8*8) gives as the T-function for a paraboloid of 
revolution 

,(< 7 ' — (t'— t )2 


( 8 - 12 ) 


T = -\R' 


V —V 


For a mirror in the form of a paraboloid of revolution, we have 
p' — ji, and hence 


(8-13) 


T = - \iiR 


y'-y 
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Ambiguities of sign are intrinsic here on account of the dual signs 
in the expressions for y\ y in terms of the other direction cosines. 
If the incident rays travel in the negative sense of the z-axis, and 
the reflected rays in the positive sense, we have 
(8-14) / = y = (l-a2-y?2)i, 

and substitution in (8’ 13) gives T jor a paraboloidal mirror free 
from ambiguity as 


(8-16) 


T =\[iR 


( 1 - a '2 - + ( 1 - a 2 - • 


The T-function is most useful for the discussion of final rays 
when the incident rays are parallel to one another. The final rays 
are, as in (7’9), 


(8-16) 


cr 

x — z— — 

V 


■3V, y-z'^=^-T„ 


and in the functions on the right cr', r' are now to be considered 
as constants. 

We may easily verify from (8- 12) the fact that when the in- 
cident rays are all parallel to the axis of a paraboloidal mirror, 
the reflected rays pass through the focus. We have cr' = t' — 0, 
v' = —p, and it is a matter of indifference whether we make these 
substitutions before or after differentiating with respect to cr 
and T. Thus we may put from (8’12) 


(8-17) 




or^ + T^ 

p + v 


= \R{p-v), 




and (8-16) read 


(8-18) x = ^-{z-\R), y = -^{z-^R), 

which equations show that all the reflected rays pass accurately 
through the point (0, 0, \R). 

The above expressions for T are exact: we shall consider in 
Chapter iv approximate forms of T, useful when we have to deal 
with a small bundle of incident rays. 
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THIN BUNDLES OF RAYS 

9. Foci and focal lines. 

Let us consider the final congruence of rays formed by the 
passage of light through an instrument from a source at a point 
x', y', z'. The equations of the final rays are, as in (6-11), 

(9-1) x — z(rlv = —W„, y — ZTlv = — W/, 

x', y', z' are involved in W , but they are to be treated as constants 
in the present work. If the initial rays do not come from a point 
source at finite distance, but are parallel, we use the T-function 
instead of W, its arguments <r', t' being constants. As a matter of 
fact, we are really concerned at present only with the final con- 
gruence of rays. We are indifferent as to its origin. We know that 
any normal congruence can be produced by reflection at a suitable 
mirror of light emanating from an assigned point source. Con- 
sequently, as far as the study of the general properties of a normal 
congruence of rays is concerned, (9-1) are completely general, or, 
indeed, similar equations with T instead of W. 

If, on a given ray R, there is a point P such that some rays, 
making infinitesimal angles with R, cut i? at P to the first order, 
then P is said to be a focus. Two lines are said to cut to the first 
order when their distance apart is an infinitesimal of order higher 
than the first. 

Suppose now that a ray R and an adjacent ray cut to the first 
order at x, y, z. We shall have equations of the form (9'1) 
satisfied by each ray, x, y, z being (to the first order) the same 
for both rays, but the components o', t differing infinitesimally. 
We may in fact differentiate (9-1), putting 

(9-2) Sx = 8y = 8z = 0. 

Hence 

(9-3) -z8(<tIv) = -8W^, -z8{tIv) = -8W„ 
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or, since = - (tr + t 5t) jv, 

(9-4) \{^(^" + ^^)/^^-K.}Scr+{z<rT/v^-W,,}8r = 0, 

^ [{zar/v^ - W„,} da - + { 2(72 + v^)lv^ - W„) 8 t = 0. 

Elimination of 8 a-, 8 t gives 

(9-5) { 2 ( 0-2 + y2)/y3 _ ^2(^2 + y2)/y3 _ J 

-{za-rjv^-W^^Y = 

a quadratic equation for 2 . Since, as will be shown below, the 
roots are real, this equation, with (9-1), determines two foci on the 
given ray. 

To show that the roots are necessarily real, we take special 
axes, the 2 -axis being coincident with the ray R, for which then 
we have 

(9-6) O' = T = 0, V = [i. 

The partial derivatives of W occurring above were to be evaluated 
for the values of a-, t belonging to the ray R: for our special choice 
of axes, these are as in (9*6). Since (9-1) are to be satisfied by 
X — y = a = T = 0, we have 

(9-7) W^ = W, = 0 

for O' = T = 0. 

We are still free to rotate the axes about the ray R. Rotating 
through an angle 6 gives a transformation 

(9'8) X = xcosd +y8ind, y = —xsind+ycosO, z = z, 
and hence, since o', t, v differ from direction cosines only by a 
constant factor, 

(9-9) ^ = O' cos -I- T sin t = — o-sin^-f tcos^, V = v. 

Now W, as an optical length, has a value independent of the 
directions of the axes. Thus 


(9' 10) 


do- 07 


W — + W- 

Wg + MV 


07 



— w 

— •'fga 


do' da- 
do- dr 




/da- dr 

1 ^ 00-07 


df 0o'\ 
00 - 07 / 


+W„ 


dr df 
da- dr 


= (Wgf, - Wf;f) sin ^ cos 0 -I- Wgf (cos2 d - sin2 
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Thus, given the axes x, y arbitrarily in a plane perpendicular 
to R, we have merely to choose the x, y axes so that 

(9-11) tan29 = -,j^^ 

’*00 ’*ff 

in order to make 

(9*12) = 0 

for the ray R. 

With this special choice of axes, for which (9‘7) and (9* 12) are 
satisfied, the equations (9-4) reduce to 

(9-13) (z//i - 8(T = 0, {zj/i - ir„) St = 0, 

and the quadratic (9*5) reduces to 

(9- 14) {zjfi - W^„) {zl/i - Tf„) = 0. 

The roots are therefore real, namely, 

(9-15) z^ = /iW„. 

Avoiding the particular case where Wg.g. — we see that 
(9' 13) has the two solutions 
(9*16) z = Zi, St = 0; z = z^, Sa = 0. 

Thus each ray of a normal congruence possesses two foci, whose 
coordinates are given by (9*5) and (9*1) for general axes, and by 
(9- 16) /or the special axes. 

By (9*1), (9-7) and (9-12), the equations of a general ray 
adjacent to R, referred to the special axes, are to the first order 
(9-17) x — zSo'j/i = —W^^Sor, y — zSTlp = —W,..^ST, 
where Scr, St are the components of the ray and the partial 
derivatives are evaluated for c = t = 0. By (9' 15) these may be 
written 

(9-18) x = {z-Zi)Sa-l/i, y = {z-z^)STjfi. 

All these rays, for arbitrary Scr, St, cut the plane z = in the line 
(9*19) a; = 0, z = z^, 

and the plane z = Zg in the line 

(9-20) y = 0, z = Zg. 

These lines (9*19), (9’20) (see Fig. 18) are called the focal lines. 
We have the following result: All rays adjacent to any ray R cut 
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{to the first order) two focal lines, one through each focus of R, which 
are perpendicular to one another and to R. 

This gives a very simple way of constructing an approximate 
model of a thin bundle of a nor- 
mal congruence when the focal 
lines are known. We simply join 
up all points of the two focal hnes. 

Let us now consider how the 
focal lines are to be found for a 
general system of axes of coor- 
dinates. We are to use (9-4), z 
satisfying (9'6). Let the roots of 
this equation be Zi, z^, and let 
the corresponding foci be Fj, F^. 

Let 8 ^(t, be solutions of (9-4) 
corresponding to z = z^, and 
8 ^t those corresponding to z = Zg. 

Let Zj, mi, % be the direction 
cosines of the focal line at Fi and Zg, those of the focal 

line at F^. It is evident, from consideration of the arrangement 
of the rays shown in Fig. 18 for special axes, that the focal 
line at Fj is perpendicular to the directions with components 
{<x,r,v) and {a' + 8 ia’, t + 8 iT, u + 8ii>). Hence 





(9-21) 
and so 


flicr + miT + niO = 0, 

]^li8iCr + mi8iT + ni8iV = 0, 


(9-22) li:mi:ni = (T8iV-u8iT):(o8icr-cr8iO):((r8iT-T8i(r); 
similarly 

(9*23) l^:m^:n^ = {t8^v-v8^t):{v8^(t-(t8^v):{(t8^t-t8^(x). 
Equations (9' 22) and (9' 23) give the directions of the focal lines gf 
a ray with components a, r, v, the axes being general. The ratios 
dicrjdiT and d^crjd^T are to be found from (9*4), after inserting suc- 
cessively the two values of z satisfying (9*5) .• 8 iV and 8^0 are to be 
found from the identical relation written just above (9'4). We could 
of course obtain the two values of darjdT by solving the quadratic 
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equation for this ratio obtained by eliminating z from (9*4); but 
we would not then know which focal line corresponded to which 
focus. 

The focal properties of a normal congruence of rays may also 
be discussed by considering the congruence as the normals to 
a wave-surface. In general, two adjacent normals to a surface 
do not intersect to the first order: but if they are drawn from 
adjacent points on a line of curvature of the surface they do 
intersect to the first order. The two foci on each ray arise from the 
intersections of the ray with adjacent rays drawn from points 
on the two lines of curvature on the wave-surface. 

A pencil of rays consists of a single infinity of rays: a pencil 
forms a ruled surface, which is developable if the adjacent rays on 
it intersect to the first order. It is evident that we can construct 
from a given normal congruence two singly-infinite sets of 
developable pencils, each developable pencil consisting of the rays 
which cut a wave-surface along a line of curvature. All the rays 
in each developable pencil touch a curve, called a caustic curve, 
which consists of the points of intersection of adjacent rays in the 
pencil; these points are centres of curvature of the wave-surface 
and foci on the rays, in the sense defined above. The totality of 
caustic curves form a caustic surface of two sheets, whose points 
are the centres of curvature of the wave-surface, or foci on the 
rays (two on each ray). All the rays touch the caustic surface, 
and a focal line of a given ray is simply a lino through a focus, 
lying in the tangent plane to the caustic surface there and 
perpendicular to the given ray. 

Let us now discuss the variation in the cross-section of a thin 
bundle of rays. Let us choose the 2 :-axis along a ray of the 
bundle, and use the special axes for which (9*12) holds. The 
boundary of the bundle will be a pencil of rays, which may be 
defined by equations 

(9-24) o- =/(«), r = g{u), 

where/, g are small functions of a parameter u. Now for any ray 
adjacent to the z-axis we have approximately from (9-1) 

(9-25) X - zarl/i = - (rW^^, y - zrj/i = - tW,^, 
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or, by (9-15), 

(9-26) X = (r{z-zi)l/i, y = t{z-z^)Iii, 

where z^, Zg are the foci. The area of the section of the bundle by 
z = const, is ± .4, where 

(9-27) A = \^{xdy-ydx), 

taken round the bounding curve of the section in the sense of u 
increasing. By (9'26) this is 

(9-28) A = 

Between the foci (zj — z){z — z^ is positive, and its greatest value 
is liZi — z^)^, which occurs for z = \{zi + z^). The corresponding 
value of .4 is 

(9-29) I = - 


Therefore the area of any section is given in terms of this 
maximum area by 


(9-30) 


A 




The area of section is thus proportional to the product of the dista.nces 
from the foci. 

Since the arrangement of rays in a thin bundle is determined to 
the first order by one of its rays (called the central ray) and its 
focal lines, it follows that given a ray, incident on a given re- 
flecting or refracting surface, and the focal lines of that ray, we 
should be able to find the focal lines of the reflected or refracted 
ray. This question will now be investigated in a special case, as 
an example of the use of the T-function. 

A thin bundle of rays is reflected or refracted at a surface, the 
plane of incidence being a principal plane of curvature : the positions 
of the foci on the incident central ray are given, and one of its focal 
lines is parallel to the direction of a line of curvature on the reflecting 
or refracting surface at the point of incidence. It is required to find 
the focal lines of the reflected or refracted bundle. 
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Let US treat the case of refraction as the more general. Let us 
choose the origin at the point of incidence of the central ray, Oz 
along the normal to the refracting surface and Oxy in principal 


n. 



directions of curvature. If the radii of curvature corresponding 
to Ox, Oy are R^, respectively, the equation of the surface near 
the origin is approximately 


(9-31) 


x^ y^ 


Let T be the characteristic function for the pair of media. Then, 
as in (7-14), (7-19), 

(9-32) T = {cr' -(t)x + {t' -T) y + {v' -v)z, 

(x' — cr X t' — t y 

and so 

(9-33) T - + 
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This is of course only an approximate form, based on the approxi- 
mate equation (9*31), and therefore valid only for rays incident 
near the origin. We note from (9-32) that tr' — c, t' — t are small: 
we may therefore, to our degree of approximation, substitute an 
approximate value for y' — y in (9-33). 

Let us suppose that the incident bundle has its central ray in 
the plane Oxz\ the refracted bundle then also has its central ray 
in this plane, and we have, accurately for the central rays and 
approximately through the bundles, 

(9’34) y' = fi' cos i', v = fi cos i, 

where /i', /i are the indices of refraction and i', i the angles of 
incidence and refraction respectively (Fig. 19). Accordingly we 
may write (9-33) in the form (accurate to the second order of 
small quantities) 

(9-35) T = ^k{M,{or'-(r)^ + R^{T'-T)% 

k = .J^. 

/tcos»— /t cos^ 

We note the following values for partial derivatives: 


(9-36) 


i;v =kR„ 

= T,,. = = 0 . 


Now the equations of the incident and refracted rays are, as 
in (7-8), (7-9), 

9.37 U'-«V/y' = T^, y’-z'r'lv' = 

[x - zor/v = -T„, y- zrjv = - T^. 

We may regard these as four equations connecting x, y, z, cr, t 
when x', y', z', a' , t' are given. If we regard z as also given, they 
determine x, ?/, <r, t; in fact, they determine the final ray corre- 
sponding to any assigned initial ray. 

Let the foci F[, of the incident bundle be at z = z'^, z = Zg, 
and let the focal line at F^ be perpendicular to the plane of 
incidence (F^ is then called a, primary focus), the focal line at F'^ 
lying in the plane of incidence (Fg is then called a secondary focus). 
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Let US pass from the central ray to any adjacent ray. Differ- 
entiation of (9'37) gives, in view of (9-36), 

8 x' — 8 {z'ar'lv') = kRi{ 8 (r' — 8 ar), 

8 y' -8(z't' I v') = kR^{ 8 r'- 8 T), 

8x—8{z<t/v) — kR^iScr' — 8 (t), 

8y — 8 (zTjv) = kR 2 { 8 T' — 8t). 

After differentiation we may insert the approximate values 

i cr' = u' 8 m i', t ' = 0 , v' = u'cosi', 

. . . 

a = /I sm^, T = 0, V = /I cos i, 

so that, by (6'8) and the corresponding accented equation, 

8 v' = — ^ = — tani'5(r', 8 v = — = — tani^tr, 

V V 

8(cr'/u') = ^—aec^i', — —sec^i. 

Let the points {x', y', z'), (x' + 8 x', y' -I- 8 y' , z' + 8 z') coincide to 
the first order at the primary focus F[, so that we have 

( 9 - 41 ) 8 x' = 8 y' = 8 z' = 0 , 

and consequently 8t' = 0, since the incident ray must pass 
through the focal fine at F'^. Then (9-38) give 

(9-421 ^ ~ = kRi{8(T' - 8<r), O^-kR^ 8 t, 

— 8 {z(rlv) = kRi{ 8 cr' — 8 ar), 8 y — 8{ztIv) = — kR^ 8t. 

Hence 8t = 0, showing that the varied refracted ray lies in the 
plane y = 0, as is indeed evident from symmetry. The refracted 
rays, corresponding to arbitrary 8 a'', will all pass through a point 
{x,y,z) provided that the equations (9-42) can be satisfied with 
8 x = 8 y = 8 z = 0. Thus we have to satisfy 
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these give the focus on elimination of Scr', Scr, by the 
formula 


(9-44) 


[i cos^ i ii* cos^ i' 


it is a primary focus since 8r — 0. 

To get the other focus, let the points 

(aj', y', z'), (x' + 8x', y' + z' + 8z') 

coincide to the first order at the secondary focus F'^, so that 
(9*41) again hold. Since the ray must pass through the focal line 
at F[, we have 

(9-45) 8<r' = 0, 8v' = 0. 

Putting 8x = 8y = 8z = 0 in (9-38) to get a focus, and re- 
membering (9-39), (9'40), we have to satisfy 


(9-46) 


0 = kRi{ — 8cr), — 7 sec i' 8 t' = kR.^{8T' — 8t), 

/* 

j — “ sec^ i 8 (t = kRi{ — 8 cr), — ^ sec i 8 t = kR 2 { 8 r' — 5t): 


these give the focus z^, on elimination of 8t' , 8t, by the 
formula 


(9-47) 


/icosi yw'cosi' _ 1 


It is a secondary focus {F^). 

If p[, p 2 denote the distances of the incident primary and 
secondary foci from O, and p^, p^, the distances of the refracted 
primary and secondary foci from 0, all counted positive when 
measured in the sense of propagation, we have from (9-44), 
(9-47) 


(9-48) 


fi cos^ i p' cos^ i' _p cos i—p’ cos i' 
Pi ~ Pi ~ ^1 

[I ti' _ II COS i — ii' cos i' 

<P2 p2 ^2 


SGO 
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10. Aberration at a focal line. 


The theory of focal lines developed in § 9 is an approximate 
first order theory. Let us now investigate more accurately the 
pattern formed by a bundle of rays on the plane passing through 
the focal fine on a central ray, and perpendicular to the ray. The 
deviation of this pattern from the 
focal line is known as aberration. 

Let us choose our axes so that 
the z-axis is the central ray, one 
focus being at the origin and the 
x-axis being a focal line. Using 
the IT-function to describe the 
system of rays (supposed to 
emanate from a source x', z' 
in an initial medium), it follows 
from (9‘1) and (9-4) that we have 
for the central ray (o' = t = 0) Fig. 20 

(10-1) w,^o, Pf, = o, w;, = o, W„r=o, W,, = al/1, 

where (0 ,0, a) is the other focus. We shall examine the aberration 
near the origin. 

The exact equations of the rays are as in (9-1) 

(10-2) x-zcrjv = -W^, y-ZTjv^-W^. 

Expanding the right-hand sides in power series in cr, t and putting 
2 = 0, we see that a ray with direction cosines a, /?, y cuts the 
plane z = 0 at the point 



(10-3) I* "" -aat + \{A<x‘^+2Bali+Cfi^) + ..., 

where 

(10-4) 

the partial derivatives being evaluated for cr = t = 0. Intro- 
ducing spherical polar angles 6, ^ to specify the direction of the 
ray, we have 

(10‘5) a = sin^ cos0, ^ = sin 0 sin 
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Including terms of the order of 6^, but of no higher order, (10-3) 
give 


( 10 - 6 ) 


X = —ad cos <f) 

+ \d\A cos^ (j> + 2jB cos ^ sin ^ + O' sin^ (j>), 
y — cos^ (j) + 2C cos 0 sin ^4 + i) sin^ ^). 


To this order of approximation, it is sufficient to substitute in 
the expression for y approximate values for cos ^4, sin ^ given by 
the first equation, namely, 


(10-7) = 



Hence 


(10-8) 2a^ — Bx^ ± 2Cx{a^0^ — x^)^ + D(a^0^ — x^). 

If 0 is held fixed, this is the equation of the locus of the points of 
intersection with the plane z = 0 of all rays making a small angle 0 
with the central ray: by varying 0 we get the whole pattern. 

The curve (10-8) with 0 constant is obviously a flat curve near 
the ir-axis. To make the radical real, we must take x so that 

(10-9) —a0^x^a0, 

assuming a positive for simphcity. Thus the curve is bounded by 
the lines x ±a0, which it touches. Since two values of y corre- 
spond to each value of x except a: = 0, + a0, the curve is a figure 
of eight. Two types are shown in Figs. 21 o, b. For x = ± a0, we 
have y = \B0^, and for a; = 0, y = \D0^\ at the latter point the 
slope is + C0ja. The curve cuts the a;-axis if, and only if, 
G^-BD>0. 


4-2 
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To find the illumination produced on z = 0 by all rays adjacent 
to the central ray, we are to superimpose the curves for all small 
values of 0. Let us find the envelope of the curves. Differ- 
entiating (10’8) with respect to d, we get, on division by 2a20, 


Gx 


( ± (^2^2 _ a.2)i + ^ 

and hence 

(10-11) (a2^2_^2)i q: 

Substitution in (10-8) gives for the envelope the parabola 


( 10 - 12 ) 


y = 


BD-C^ 
2a^D 




A mere reversal of the y-axis changes the sign of D, so that we 
may suppose D > 0 without loss of generality. Then the figures of 
eight cut the positive y-axis, as shown in Figs. 21a, 6; Fig. 21a 
shows the case BD—G^>Q and Fig. 216 the case BD—C'^<<). 
In either case the figures of eight lie entirely on one side of the 
parabohc envelope. No rays meet the plane z = 0 on the opposite 
side of the parabolic envelope, which therefore divides the plane 
into two regions — bright and dark — the light in the bright region 
being concentrated near the parabohc envelope. Thus the theory, 
of geometrical optics indicates a sharp separation between light 
and darkness, but in reality the two regions will merge into one 
another with bright and dark diffraction bauds. 


11. Principal foci: aberration at a principal focus. 

A point P on a ray R is said to be a principal foctis if all rays 
which make with R an angle less than a small angle 0 pass through 
P, distances of the order of 0^ being neglected. A ray containing 
a principal focus is called a principal ray. 

Using the function W, as at the beginning of § 9, to describe 
the congruence of rays — the T-function may of course be used 
similarly — the condition for a principal focus at x, y, z is obviously 
that the conditions of intersection (9-4) should be satisfied for 
arbitrary values of S(T, dr. Thus the equations 

(11-1) z{cr^ + v^)jv^ — zar/y® = z{T^ + v^)jv^ = W„ 
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are to be satisfied. These are three equations for z, a, t, since v 
is given by 

( 11 - 2 ) — — 

When these quantities have been found, x and y are given by 


(11-3) x — zcrlv = —W„, y — ZTlv = —Wf. 

Hence we can locate the principal foci (x, y, z) and the principal 
rays (cr, t) through them when we know the function W. We note 
that the components of the principal rays satisfy 


(11-4) 


\v w w 

or 

(TT — 


In general a given congruence of rays will possess a finite number 
of principal foci and principal rays. 

If we take a principal ray for z-axis and the principal focus on 
it for origin, (ll-l)and(ll*3) must be satisfied with a: = ;i/ = z = 0 
Hence we have 


(11-5) = = = = = o 

for cr = T = 0. 

Let us now investigate the pattern formed on the plane through 
a principal focus perpendicular to the principal ray. This plane 
is called a focal plane, and the deviation of the rays from the 
principal focus is known as aberration. 

The exact equations of the rays are (11’3). Let us take the 
special axes of coordinates described above, so that the focal 
plane is z = 0. Developing the right-hand sides of (11’3) in power 
series, we see that the intersection of the ray with direction 
cosines a, fi, y with the focal plane is (on account of (11‘5)) 

\x = \{Aa^+2Baili+Cfi^) + ..., 

^ \y = {{Ba^+2Cafi + Dp^) + ..., 

where A, B, G, D are constants as given in (10*4). Introducing 
the angles 6, (j> as in (10-5), we see that, to the order 0^ inclusive, 
we have 


(11-7) 


2xd~^ = l(A + G) + i(A — G) cos 2^ -f H sin 2(p, 
2yd~^ = \{B + D)+\{B—D) cos 2f>+G sin 2f>. 
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Elimination of §5 from these equations gives an ellipse with centre 
at the point 

( 11 - 8 ) x = i(A + C)0^ = + 

this is the curve traced out in the focal plane by those rays which make 
with the principal ray a small angle d. 

If we change d to d' in (11*7), but hold fixed, the corre- 
sponding point x', y' is such that 


(11-9) 


X y~ 6^’ 


Hence, given one of the ellipses (11‘7), all the others can be 
obtained from it by magnification with respect to the origin: the 
ratio of magnification is positive, and therefore all the points met 
by rays are to be found on the lines obtained by joining the origin 
to the points on the ellipse and producing these lines away from 
the origin. If the origin lies outside the ellipse, these lines are not 
to be produced through the origin, since this would correspond 
to a negative ratio of magnification. 

Let us investigate the envelope of the ellipses (1 1’7) for various 
values of 6. The intersection of consecutive ellipses must satisfy 

\ 2xd{d-^) = [ - (^ - (7) sin 2<l> + 2B cos 2f>] d<f), 
\2yd{0~^) ■= { — {B — D) sin 2^ + 2C cos 2^] d^. 


The envelope is therefore to be found by ehminating 6 and 0 
from (11’7) and 


(IMl) 


X _ (A — C) sin 2f> — 2B cos 2f> 
y {B — D) &\n2(j) — 2C co&2f)’ 


Comparing this expression with the value of xjy given by (11‘7), 
we see that a real envelope exists if, and only if, (p can be found 
to satisfy 

. . (.4 — (7) sin 2^ — 2 jB cos 2^ 

(J3 — D) sin 2^ — 2(7 cos 2^ 

_ A + G + {A — C) cos 2<j> 4- 2B sin 2^ 

B + D + (B — D) cos 2f) -f 2C sin 2(p ' 
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This equation reduces to 

(1 M3) [C{A + C)- B{B + D)] cos 2<p + {AD - BC) sin 20 

= B{B-D)-C{A-C), 


or 


(11*14) cos(20 — a) = 
where 

(11-16) tana 


B{B-D)-G{A-G) 


[{G{A + G)-B{B + D)Y + {AD - BGf]^ ' 
AD-BG 


G{A + G)-B{B + Dy 
The condition, necessary and sufficient, for the existence of a real 
envelope is that the right-hand side of (1 1*14) should not exceed 
unity in absolute value: this is expressed analytically by 
(11-16) 
where 

(11-17) E =={AD-BGy-4:{B^-AG){G^-BD). 


Gase I. E>Q {Goma). 

Since the ellipses are obtained from one another by magnifica- 
tion with respect to the origin (the principal focus), their envelope 
consists of a pair of lines passing through the focus. The equations 



Fig. 22 
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of the lines may be found by substituting in ( 1 !• 1 1) the two values 
of 2^ (in the range 0, 27i) given by (11-14). The ellipses lie only in 
one of the regions encompassed by the Unes, as shown in Fig. 22. 
Thus the illumination on the focal plane is confined to a wedge- 
shaped region, the bounding lines being especially bright. This 
flare of illumination is known as coma, from its resemblance to 
the tail of a comet. 

Case II. E <0 (General illumination). 

In this case (Fig. 23) the ellipses are contained inside one 
another and the optical focus is inside them all ; the whole region 
in the neighbourhood of the focus is illuminated. 



Fig. 23 
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THE INSTRUMENT OF REVOLUTION 


12. Approximate form of T for any reflecting or re- 
fracting surface of revolution. 


Let us take the axis of symmetry of a reflecting or refracting 
surface of revolution for 2 -axis. We assume that the equation of 
the surface may be expanded in the form 


( 12 - 1 ) 


2 = V-f- 


-I- (x^ -F «/2)2 


2E 


- 1 - 


45 


-f ..., 


where v, R and S are constants, R being the radius of curvature 
at the vertex z = v. We shall confine our attention to rays which 
are approximately parallel to the axis of symmetry and which 
meet the surface near the vertex. To the order of approximation 
which we shall employ it will be unnecessary to consider terms 
in the expansion (12*1) beyond those shown. 

To get a paraboloid of revolution we put 


( 12 - 2 ) 1/5 = 0 . 

If the surface is a sphere of radius R, we have accurately 
(12*3) x^ + y‘^ + {z — v— R)^ = R^, 

or, to the above order of approximation, 


(12-4) 


, _ „ 4 - 4 . 

2R SR^ ' 


Thus, if the surface is a sphere, we are to put 
(12-5) 5 = 2R^. 

Let us now calculate T as a function of the initial components 
cr', t' and the final components o*, t, retaining only terms up to 
and including the fourth order in these components, which are 
small since the rays are approximately parallel to the axis. By 
(7*14) we have, accurately, 

(12-6) T = {cr' — a) X + {t' — T)y {v' — v)z, 

where x, y, z is the point where the ray meets the surface. We are 
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to eliminate x, y, z by the condition that T shall have a stationary 
value with respect to variations of x, y, z on (12*1). 

Let us write 

(12'7) A(r — <r — (T’, At = t — t', Av = v — v', 

A indicating in general an increment occurring at the reflection 
or refraction. Then 


( 12 - 8 ) 

and as in (7*19) 
(12-9) 


Thus 

( 12 - 10 ) 


T = —xA(r—yAT — zAv, 
'A(r dz /I 

At _ dz _ /I r^\ 

(r* = x^+y^). 


A(t 11 r^\~^ 



correct to the third order, and to a first approximation 


( 12 - 11 ) 


X = — R 


A<t 

Tv' 


y = -R 


At 

Tv' 


^ 2 _ 

(^y)2 • 

Substituting in (12-10), and the similar equation for y, we get, 
correct to the third order. 


do-/ i23(do-)2 + (dT)2\ 
dy\ S "(dy)2‘ ) 


At I R ^Acrf + {ATf \ 
dy\ 8 (dy)2 ; 


and hence, correct to the fourth order. 


(12-13) 


(4<r)»+(4r)>/ 2iP(J»-)»+(Jr)n 
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Then, by (12*1), we have, correct to the fourth order. 


(12*14) 






3i23(Jo-)2+(-dT)2\ 

2 8 (Avj^ l‘ 


Substitution in (12*8) from (12*12) and (12*14) gives, correct to 
the fourth order. 


( 12 * 15 ) T = 

^ di; \ 2 aS (^y)2 ;• 

This is the approximate form for T for any reflecting or refracting 
surface of revolution. 


z 



Let us now consider the case of a mirror of revohition with the 
equation (12*1), the rays being incident from z = +oo as in 
Fig. 24, so that the direction cosines of the rays satisfy 
(12*16) 0>y' = -(l-a'2-yff'2)*, 0<y = (l-a^-y^z)*. 

We shall suppose fi = 1. Then 


(12*17) 


Acr= Aoc, At — A/i, 

Av= y — y' = 2— + 0^) — 

-|(a'2 + /?'2)-^.(a'2 + y^'2)2, 
~ = J[l + J(a'2+^'2) + J(a2 + ^2)J^ 


1 

(Ji;)2 


= h 
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these expansions being sufficiently accurate for substitution in 
(12"16). Thus we get, correct to the fourth order, 

(12-18) T = -2w + -|7;(a'2 + A'2)[l + i(a'2 + yff'2)] 

+ ^«(a2 +/?2) [ 1 + |(a2 + ^)] + Ji? [(zla)2 + 

X [l + i(a'2 + /?'2) + + ^2) _ 1 I" |(^a)2 + (Zly5)2}J . 

For a paraboloid this reduces to 

(12-19) T = -2t;+’w(a'2 + /^'2)[l + J(a'2 + yff'2)] 

+ iv(a2+y^2)[-i + |(a2 + ^)] 

+ iR [{Aaf+{Am [ 1 + 

and for a sphere 

(12-20), T = -2v + iv(a'2 + yff'2)[l + J(a'2 + /?'2)] 

+ \v{a.^ + /?2) [ 1 + i(a2 + /?2)] + Ji? [(zla)2 + (zl/?)2] 

X [1 + i(a'2 + p) + |(a2 + /?2) _ ^ {( Ja)2 + (A^- 


% 



Let us now consider a refracting surface of revolution with the 
equation (12-1), the rays being incident from z = — oo as in Fig. 25, 
so that 
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( 12 - 21 ) 


0<u' = (/i'2 — (r'2 — 0< y = (/f2_ 0-2 _ 7-2)1^ 

. , /, 10-2 + 72 1 ( 0-2 + t 2 ) 2 \ 

Au=V-V =/<|l-H o o- 

2 8 /I* J 

10-'2 + t '2 1 ( 0-'2 + t ' 2 ) 2 \ 

2 /t '2 8 /■» I 


A/i-iA-~ 8. 

± = J_ + A _± /I ^!+jA 

zly A/I 2 (^/t )2 ’ 

I 1 




3 ’ 


Uzly )2 {A/ir 
then (12-15) gives 

fT^ -t- ( (T^ -4- 

(12-22) T = —vA/i + ^vA — f- — 


+ hR 


(4^)^+^t)_2 

A/i 


^[ 2^/* “ 2 ,S’ ' (A/if J' 

For a sphere this becomes 


(12-23) T = V A fi + \vA ~ ~ - + IvA ^ — 3^- 

(4o-)* + (/lr)» 

+ “ J/T” 


[' 


1 .(72 + 72 1 (z 1(7)2+ (Zl7)2' 


^‘^'^ 2 / 1 /*'^ /* 4 (zl/t )2 


]• 


13. General form of T: method of calculation up to the 
fourth order. 

An instrument of revolution is an instrument with an axis of 
symmetry, such that the instrument is unchanged (optically) 
when rotated through any angle about that axis. The most im- 
portant optical instruments are of this type. 
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Let US take the axis of symmetry for e-axis : as usual we shall let 
/i', a-', t', v', a!, fi', y' 
refer to the initial medium, and 

fi, (T, T, V, a, p, y 

to the final medium. The T-function for the instrument is a 
function of the four quantities cr’, t', a, t: it may therefore be 
expressed as a function of the four quantities 

(13-1) + or'(T + T'T, cr^ + T^, cr, 

because these quantities determine cr’, t', cr, r. The quantities 
( 13" 1 ) in fact determine a ray : if we fix the first three of them and 
allow cr to vary, we get a single infinity, or pencil, of rays. We have 
' o-'2-fT'2= + = /2(l-y'2), 

(13-2) - cr'cr + T'T = n' fi{a'(X + fi) = fi'pL{ciosd — y'y), 

-f -f- fp) — ii^{ 1 — 7^), 

where 6 is the angle between the initial and final rays. Thus if the 
first three quantities in (13-1) are given, y', y and 0 are deter- 
mined, or, in other words, the inclinations of the initial and final 
rays to the axis of the instrument and also the mutual inclination 
of the rays are determined. Now if wc take a natural ray passing 
through the instrument and give it a rigid body rotation about 
the axis of the instrument, it is clear from the symmetry of the 
instrument that the ray so obtained will be a natural ray, satis- 
fying the laws of refiection or refraction. But under this rotation 
y', y and 6 remain constant: hence the first three quantities in 
(13‘1) remain constant: in fact, the pencil of rays obtained by this 
rigid body rotation is precisely the pencil obtained by holding 
these quantities fixed and varying cr. But from its definition as 
an optical length T is the same for all the rays of the pencil: 
hence T is actually a function of the first three quantities only 
in (13"1). Let us put 

(13-3) e' = (r'2 + T'2, e, = (t'cx+t't, e^cr^ + r^-, 
then 
(13-4) 


T = T{e',e„e). 
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The T-function is particularly useful in those cases where the 
initial rays form a parallel system; when the initial rays diverge 
from a point source x' , y', z' at finite distance it may be more 
convenient to use the IF-function. In general IF is a function of 

x', y', z', (T, t: 


by an argument similar to that used above it is easily shown that 
on account of the symmetry of the instrument W is expressible 
as a function of the quantities 

(13'5) z' , x"^-Vy'‘^, x'(T + y'T, cr^ + r^. 


We shall use the T-function in what follows. 

We shall assume that the rays lie close to the axis of symmetry 
and are nearly parallel to it. Thus e', e,, e are small, of the second 
order relative to the inclination of the ray to the axis. We shall 
suppose that T may be expanded as a power series, 

(13-6) T = T<«)+T<2)+T<«) + ..., 

where is a constant and 

[T(2) = PV+P,e, + Pe, 

|p(4) == Q'’e’^^Q^e^^ + Qe^+Q'^e'€, + Q'e'e+Q,e,e, 

where the P’s and Q’s are constants characteristic of the instru- 
ment. They depend on the position of the origin on the axis. 
The superscripts (0), (2), (4) indicate orders of magnitude, the 
inchnation of the ray to the axis of the instrument being the 
fundamental infinitesimal. In what follows we shall not include 
terms of order higher than the fourth in T. 

As an illustration of the notation employed in (13*7), the 
results given in (12-18), (12-19), (12-20), (12-22), (12-23) may be 
exhibited as follows, the surface having the equation 


(13-8) 


z = v + 


x^-k-y^ 


2R 


(X2 + 2/2)2 

4;Sf 


For simplicity, the origin is taken at the vertex in some of the 
formulae given below. To change to a general origin on the axis 
of the instrument, we may refer to § 12, or use (7-30). The direc- 
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tions of the rays are as in Figs. 24 and 25 for reflection and 
refraction respectively. We note that 


[m vacuo: — e' — 2e, +e, 

|m general : {A cr)^ + (/It)^ = e' — 2e, + e. 


General mirror of revolution urith vertex at the origin (w = 0). 

T = \R(e' — 2e, + e) j^l + Je' + |e — - ^ (e' — 26', + e) J , 

P' = P=^IR, P, = -hR, 


Paraboloidal mirror of revolution with vertex at the origin (v — 0). 


(13-11) 


' T = iR(e' - 2e,+e) (1 + le' + Je), 
P' = P = \R, P,=^-\R, 

Q" = Q = -hR, Q„ = o, 

Q: = Q,^-kR, Q' = \R. 


Spherical mirror with vertex at the origin (t; = 0). 


(13-12) 


T = IRie' — 2e, + e) [1 + ^e' + je — j^ie' — 2e, + e)], 
.P' = P = iR, P, = ~hR, 

Q" = Q = ^^R, Q,, = <;>; = = Q'=^R. 


General refracting surface of revolution. 
(13-13) 


{ T = — v{p—n') + \v/x,~^e — \vp'-'e' + J vii-^e^ — \vn'-^e'^ 


P' = 


1 n 

“ 2 » 

I V R r,_^^ R p _ R 

■ 2/ ■^2(/i -/*')’ ~2]i'^2{/i-^'y 



UP TO THE FOURTH ORDER 


Q” = - 




r n-il 1 

'fif' ^ J 8//3’ 


n — ^ r -1 




<?„ = - 


R* 

8{fi-ii'r 

R r 


2 {[i-ii') 


r , , 2/?3^ n 

+ 8 J’ 


^ ^ r 


. 2 ie 3 ^ ,1 


Hr ^2 H‘^ 1 

Spherical refractor. 

(13-14) 

r = — w(/i — /t') + Ji>/i "’e — 

+ 2(/^^ + e) [1 + l(lt-ii')-^ (6/i-i - 

(e'-2e,+e)], 

P'=-^-+ -_ P = - ^ 

2fi'^2{/i-/i'y 2/i'^2{/i~fi'y ’ /i-fi'’ 

^ I{{2jLl—/l') 1 V 

— 8///^’ 

' O - ^ fu-^ -Ha- «M-11 + ^ ^ 

^ /M J + K//, 3 “ 8 /H //,-//.' -> 3 + K//. 3 > 


8 8 /fc(/^ — ii'Y ^ A 


2(/f-/i')3’ 


«- - - m-7f' 


SGO 


5 
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In § 7 we had a general method for the calculation of the T- 
function for any instrument. Two steps were involved: (i) the 
determination of the T-function for each pair of successive media, 
(ii) the elimination of the intermediate components, so as to 
leave T a function of the initial and final components. Now we 
are only interested in the calculation of T up to the fourth order 
for an instrument of revolution. The general process may be 
simplified under these circumstances. 

In the notation of (7*28) the T-function for the complete in- 
strument is 

(13-15) r = 

the intermediate components are to be eliminated by means of 
dT dT 

(13-16) ^ = 0, =0, 

Now T as given in (13-15) may be expanded in the form 
(13-17) T - 

the superscripts indicating orders of magnitude in the small 
components. If we retain only terms up to the fourth order in T, 
the equations (13-16) are of the third degree: we shall see, how- 
ever, that it is sufficient to eliminate the intermediate variables 
by means of linear equations, in accordance with the following 
theorem: If instead of using the exact equations (13-16) we 
eliminate the intermediate components from (13-15) by means of 
the linear equations 

077 ( 2 ) 077 ( 2 ) 

(13-18) -g^- = 0, g-- = 0, {i=l,2,...,n-l), 

the error so introduced in T is of the sixth order, and is therefore 
negligible when T is required only to the fourth order. 

It is simplest to jn-ove this theorem in a more general form. 
Let x^, x^, be a set of small variables (corresponding to the 

intermediate components) and y^, y.^, ...,yn another set of small 
variables (corresponding to the initial and final components). 
Let / (corresponding to T) be a function of the (c’s and y’s, and 
let it be represented by a series of the form 

(13-19) flx,y) =/<®-l-/®(a;,«/)-f-/(«(a;,y)4-..., 
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/W being a constant and /®, polynomials, homogeneous of 
degrees 2 and 4 respectively. We are to eliminate the x’s from/ by 
two different processes. Let us in general write 

(13-20) = 

The first process of elimination is by means of 
(13-21) ff^(x,y)^0. 

Let the solution of these equations be 

(13-22) a:, = Uv)', 

when these are substituted in f{x,y), we get 
(13-23) F{y)=m,y). 

We have of course 
(13-24) 

for arbitrary values of the y’s. 

On the other hand, let us eliminate the a:’s by means of 

(13-25) Mx,y)=^0. 

Let the solution of these equations be 
(13-26) Xr = iriy)+vAy)'> 

when these are substituted in f{x, y) we get 
(13-27) 0{y)^M + y,y). 

We have of course 
(13-28) 

for arbitrary values of the y's. fVe shall now 'prove, that 

G{y)-F{y) 

is of the sixth order. 

The equation (13-28) may be written 

(13-29) f?\$+v,y)+n^\i+y,y) + :- = o, 

or, expanding the first term as a power series in the ^’s, 

m 

(13-30) fi^\i,y)+ lvsf^fity) + -+fi*\^+v,y) + ^- = 0, 

S=1 

where f^f is a partial derivative of the second order. The first 
term vanishes, by (13-24): are constants: the term is of the 


5-2 
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third order. Hence the t/’b are of the third order. We have then, 
writing Og for terms of the sixth order, 

(13-31) 0(y)-F{y) = M + 

+ y, y) y ) + 

in, m m 

= i 2 y)+ 2 y) + 

r=ls=l r=l 

= Oe; 

this estabhshes the result, and hence proves the theorem asso- 
ciated with (13-18). 

The simplification consequent on the use of linear equations 
(13-18) instead of cubic equations is naturally very great. 

14. First order theory: object and image points: cardinal 
points. 

We shall now develop the first order theory of a general instru- 
ment of revolution, the rays being adjacent to the axis of the 
instrument. In this approximation we shall neglect in the 
equations of the rays the squares and higher powers of the 
distances from the axis of the instrument and of the inclinations 
of the rays to the axis. It is therefore only necessary to retain 
terms of the second order in T, 

Taking the z-axis along the axis of the instrument, we have to 
the required order of approximation 

(14-1) T = P'e' +P^e, + Pe + const., 

as in (13-7). To the order of approximation here considered, the 
three constants P', P,, P and the initial and final refractive 
indices determine the optical behaviour of the instrument. We 
shall suppose these constants known, and develop the optical 
properties in terms of them. 

The equations of the initial and final rays are given by (7-8), 
(7-9), in which x', y', x, y, cr', t', cr, t are small. Since 
(14-2) u'2 = — — 
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we have to the first order 


(14-3) 


v' = rj'fi', V = 7i/i, 


where 7)' , y are ± 1, being + 1 if the ray in question is proceeding 
in the positive sense of the 2 -axis and — 1 if it is proceeding in the 
negative sense. When only refractions occur, 7f' and tj have the 
same sign, so that tj't/ = 1. More generally, this condition holds 
when the final rays have the same sense as the initial rays. We 
may then say that the instrument is direct. When the instrument 
is a single mirror, we have tj't) = — 1. More generally this con- 
dition holds when the final rays have a sense opposite to that of 
the initial rays. We may then say that the instrument is reversing. 
By retaining the factors ^ we shall be able to discuss both types 
of instrument at once. 

By (7-8), then, the equations of the initial rays are approxi- 
mately 

( dT 

\x'-7i'z'(r'//i' = = 2P’cr' + P,(r, 


(14-4) 


y' — ri'z'r'j/i' 


dT 

'97' 


2PV + P,t, 


and the equations of the final rays are, by (7-9), approximately 


(14-5) 


dT 

x — Tjzcrjji — ~ ^ — — P,cr' — 2Prr, 


y-rjZTl/i 


dT 

97 


-P,t'-2Pt. 



Fig. 26 
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Let US suppose that there is a point source of light at a point 
r near the axis of the instrument, the coordinates of I' being 
* > y'f (J’ig- 26). In (14-4), (14-5) we have four equations: 
ehmination of cr' , t' will give the equations of the final congruence 
of rays due to the source I'. By (14*4) we have 


(14*6) 


<r = 


x'-Pa- 


■P.T 


T = 


2P' + 7)'z'I/i” 2P' + 7}'z'I/i'' 

Substituting in ( 14' 6) and rearranging, we have 

No matter what values cr and t may have, these equations are 
satisfied by 


(14-7) 


(14-8) 


X = 


P.x' 


2P' + 7}'z'I/i'^ 

P? 


y o b' 


P,y' 


2P' + 7l’z’lfl'’ 




Thus, to the first order, all the rays emanating from an object point 
I'{x', y' ,z') in the initial medium pass {after traversing the instru- 
ment) throvcgh an image point I{x,y,z), with coordinates given by 
(14-8). 

The equations (14* 8) may be written more symmetrically 

nj.m y'P'p' ^z-2vP/i 

^ ^ x' y' z' + 2y'P'ii' yP,p ’ 

(14-10) (z-27}Pij^){z' + 2ri'P'ii') + (7i'P,ii'){riP,p) = 0. 

Since xjy = x'jy', it follows that the object and image points lie 
in a diametral plane, a diametral plane being a plane through the 
axis of the instrument. This fact is of course obvious from the 
symmetry of the system, since the instrument and the congruence 
of initial rays have the diametral plane through the object point 
for a plane of symmetry. 

It is also clear from symmetry that if the object point lies on the 
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axis of the instrument, so also does the image point: this also 
follows from (14‘8), since a;' = y' = 0 imply x = y = Points on 
the axis such that one is the image of the other are called conjugate 
points: the relation between the positions of conjugate points is 
(14-10). The planes through conjugate points perpendicular to 
the axis of the instrument are called conjugate planes. It is 
evident that an object point and its image are situated in con- 
jugate planes. 

We shall now define three pairs of cardinal points on the axis 
of the instrument. These are the focal points, the nodal points 
and the principal points, the planes through them perpendicular 
to the axis of the instrument being the focal, nodal and principal 
planes. 

The focal point F' is defined as the object point whose image 
point is at an infinite distance on the axis of the instrument, and 
the focal point F is defined as the image point whose object point 
is at an infinite distance on the axis of the instrument. Thus the 
z' of F' is to be found by letting 2 -> oo in ( 14- 10) and the 2 of -F by 
letting 2 ' ->oo in the same equation. Hence we have for the focal 
points 

(14-11) z{F') ^-2r}'Pji', z(F) = 2?/P/i. 

Since object and image points are principal foci in the sense of 
§ 11, and in particular so are focal points, we may say that 

the focal point F' is the principal focus in the initial medium 
for rays parallel to the axis in the final medium; 

the focal point F is the principal focus in the final medium for 
rays parallel to the axis in the initial medium. 

The nodal points N' , N are defined as conjugate points such that 
the corresponding rays through them are parallel (in the same or 
opposite senses). If the senses are the same, we are to have 
of = a, P' — P, and if they are opposite, of — —oc, P' = — p. 
Thus the conditions for the two cases are included in 

(14-12) 5/'a' = rja, if P’ = rjP, 

a! , P' and a, P being direction cosines of corresponding rays 
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through N' and N respectively. Substituting in (14-4), (14*5) 
the following values, 


(14-13) 
we have 
(14-14) 


\x' = y' = X = y = 0, 

|cr' = /t'a', r' = fi'P', cr — jj/x, t = nfi, 

[ — y'z'a' = 2P' fi' of + P^fia, 

— Tjza = — P,n' a! — 2P[iol, 


with similar equations with fi', ^ substituted for a' , a. Hence by 
(14-12) we obtain for the positions of the nodal points 


\z{N’) = -2y'P'fi'-yP,fi, 
I z{N) = 2rjP/j, + 'ri'P,/i' . 


Let us now take as object a short line A'£', perpendicular to 
the axis, A' being on the axis. I^et A be the image of and B the 


B' j 

1 

1 < 
1 

1 

B 

1 

A 1 

1 

t 1 

[ A 


Fig. 27a 


B' ^ 
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j 

( 

A 

A' ! 

( 

( 

! 

1 


i 

; 1 

B 


Fig. 276 


image of B'. We know by (14-8) that AB will be parallel to A' B': 
all the points on ^ will have images on AB, and we may speak 
of the line AB as the image of the line A'B'. The line AB may 
have the same sense as A'B' (Fig. 27a) or the opposite sense 
(Fig. 21b). In the former cases we have an erect image, in the 
latter an inverted image. 

We define the magnification (m) to be ±ABjA'B', the + sign 
being taken when the image is erect, the — when inverted, f By 
(14-9) the magnification is given by 

y _ 2 ~ 2rjPii 


(14-16) 


X 

m = - = 

X y 


yP,[i 


The principal points U', U are defined to be conjugate points 

t Or we may say m = ABfA'B', interpreted algebraically. 
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(14-18) 


(14-19) 


for which the magnification is equal to unity. Hence, putting 
m = 1 in (14-16), we have 

\5(C1)=v(2P + P,). 

By (14-5) we have, for any ray through U, 

r-o-(2P+P) = -P(r'-2P<T, 

(14-18) V , 

^ ’ [-t(2P + P,) =-P,t'-2Pt. 

Hence for corresponding rays throv^h the principal points the 
relations cr' = cr, t' = t hold. These eonditions might have been 
used as a definition of principal points. 

Let us put for brevity 

(14-19) f2^yP' = «', v'p'P,=b\ 

2ri/iP = a, fl/iP, — b. 

Then by (14*11), (14-15), (14-17) the positions of the cardinal 
points are as follows: 

(z{F') = -a', z{F)=a, 

(14-20) c(A^') = -a'-h, z{N) = a + b', 

\z{U') = -a'-b', z{U)^a + b. 

Hghcc 

{ z{N')-z{U’) = z{N) -z{U). 
z{N')-z(F') = ziF) -z{U), 
z{N) -z(F) = z(F')-z{U'). 

Using the ordinary notation for directed segments on a line, in 
which a segment is counted i^ositive or negative according as it 
runs in the positive or negative sense, we have 

(14-22) U'N' = UN, F'N' = UF, FN = U'F'. 

A possible arrangement of the cardinal points is shown in Fig. 28. 


(14-20) 


r 



N U 

F 



Fig. 28 
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The following relations are also easily proved: 

(14-23) riiiFN + ri’n'F’W = 0, rjfiF'U' + 'ti'fi'FU = 0. 
Further, if O', O is any pair of conjugate points, it follows from 
(14-10) that 

(14-24) FG.F'C' = FN.F'N' = FU.F'U'. 

The pairs of conjugate points form two homographic ranges on 
the axis of the instrument,! the relation (14-10) being, in the 
notation of (14-19), 

(14-25) (z-a){z’ + a') + bb' = 0. 

The double points, i.e. those points which are their own con- 
jugates, are found by putting z' = z: thus 

(14-26) z^ + z{a' — a) — aa' + bb' = 0, 

z = i{z(F') + z{F)} + ^HFF^ + F'N' .FN. 

Such points necessarily exist if bb' < 0, which is the case if 7)', rj 
have opposite signs, i.e. if the instrument is of the reversing type. 

The first and secondfocal lengths of the instrument are defined as 

(14-27) f = F'U', f=UF. 

Thus, by (14-20), 

(14-28) f' = -ri’n'P„ f=-rifiP„ 

and so the focal lengths are connected by the equation 

(14-29) = rififi. 

We observe that when the initial and final refractive indices are 
the same (ju,' = ju,) and the instrument is of the direct type (i/' = fj), 
we have /' = /. Further, in this case the nodal points coincide 
with the principal points. 

The image of a given object can be constructed very simply 
when the focal and principal points are given. Let A'B' be the 
object (Fig. 29). Through B' draw a parallel to the axis of the 
instrument, cutting the principal planes at V, V respectively. 
The incident ray B'V' emerges as VF. Through B' draw B'F', 
cutting the principal plane through U' at W. Through W draw 
a line parallel to the axis of the instrument, cutting the principal 

t Of. C. V. Durell, Plane Oeometry, Part II (London, 1910), p. 206. 
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plane through U at W. The incident ray £'JP' emerges along the 
line just drawn, and the image of B' is the intersection B of this 
line and VF. 



Fig. 29 


If the nodal points N', N arc given we may use a ray through 
them instead of one of the two rays given above, remembering 
that the final part of the ray through N is parallel to the initial 
part through N'. Thus we may carry out our construction using 
any one of the following sets of points: F'FU'U, F'U'N'N, 
FUN’N. 

As a simple illustration of some of the preceding formulae, we 
may apply them to the case of a mirror of radius of curvature B 
with approximate equation 


(14-30) 

We have by (13-12) 


~2R' 


(14-31) P' = P = IR, P, = -\R, 


and fi,' — fi = 1, ^' = — 1, = 1, the incident rays travelling in 

the negative sense as in Fig. 24, in which now v = 0. We find 
from formulae given above that the focal points coincide at 
z = \R, the nodal points coincide at z = i? (the centre of 
curvature), and the principal points coincide at z = 0 (on the 
mirror). 

Taking (14-30) for the surface of separation of media of refrac- 
tive indices fi' for z < 0 and fi for z > 0, as in Fig. 25 with v = 0, 
we have as in (13-13) for « = 0 


(14-32) 


P' 


= P = 


R 

2(p~p'y 


p. = 


R_ 

H-H'' 
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We are also to put rj' = t] — 1 . We find for the focal points 

(14-33) z{F')^ ^-R, z(F) = -^,R. 

fi—fl /t— /I 

The nodal points coincide at the centre of curvature z = R, and 
the principal points coincide at 2 ; = 0. 


15. Spherical aberration, astigmatism, coma, curvature 
of the image, distortion. 

We have seen that if, for rays adjacent to the axis of an in- 
strument of revolution, we neglect in the equations of the rays 
small quantities of order higlier than the first, then to each object 
point there corresponds an image point. In fact, if the incident 
rays emanate from a point, the final rays pass (to the first order) 
through a point. 

One of the purposes of an optical instrument is to produce a 
point image of a point object. To the first order any instrument of 
revolution does this for monochromatic light. But when terms 
of higher orders are taken into consideration this is no longer the 
case. Furthermore, given a small object-pattern on a plane per- 
pendicular to the axis, a perfect instrument should produce on 
some plane an image-pattern, in which the dimensions of the 
object-pattern are magnified or diminished uniformly. It is 
evident from (1 4* 9) that, to the first order, every instrument of 
revolution is perfect in this respect. But, again, defects appear 
when a more accurate discussion is given. 

Confining our attention to monochromatic light or to an in- 
strument involving reflections only, so that chromatic aberrations 
(§ 19) do not occur, we have to consider defects in an instru- 
ment of revolution. The first three of these, spherical aberration, 
astigmatism and coma, arise from failure to produce a point image. 
The other two, curvature of the image and distortion, concern 
the failure of the instrument to reproduce a plane pattern to 
scale. 

In the approximation now to be given we shall include terms 
of the fourth order in T, but omit terms of the sixth order. Thus 
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(15-1) T = + + + + 

+ Q\e!e^ + Q'e!e 4- Q,G^e, 
e' = o-'^ + r'^ e, = o-V + rV, e = + 

We shall limit our attention to object points at an infinite distance, 
so that the congruence of incident rays from an object point 
consists of parallel rays; u' and r' are then the same for all these 
rays. To deal with objects at finite distance it is more convenient 
to use the characteristic function W . 

We shall assume that the final rays travel in the positive sense, 
so that v>0. We have then approximately 

Since 


(15-3) 
d(r~ dr ~ 


= 2(r, 


- = 2t, 


the exact c(iuations (7-J)) of the final rays, namely, 


(16-4) 


cr 

— x + z— = 

V 


dT 

ao-’ 


-y+z 


T 

V 


dT 

aV’ 


give, to the third order inclusive, 

— a: + 1 + = P,or' + 2P(r + 2Q„e,o-' 

+ 4^60- + Q',e'(r' + 2Q'e’(T + Q,(cr'e + 2e,cr), 

— y-\- ZTii-^{ 1 + = Pj' + 2Pt + 2Q„e,T' 

+ 4QeT + Q',e'r' + 2^'e'T + Q,(T'e + 2e,T). 

The terms omitted are of the fifth order at least. 



Spherical aberration. 

Let there be an object point at an infinite distance on the axis 
of the instrument. The incident rays are then parallel to the axis, 
and we have 


(15-6) o-' = t' = 0, e' = e, = 0. 

Hence by (IS- 5) the final rays have the equations 
7 \-x + Z(rii-\l + \€iJt,-^) = 2P(r + 4Qeor, 

— t/ + 2T/i~^(l + = 2PT + 4^eT. 
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The final ray with components <r, t cuts the plane z = const, in 
the point 

115-8) "" (T[i-\z-^2P[i) + \ireii-'^{z-'^Qn^), 

= T/t-i(z - 2P/t) + \reii-\z - 8^/P). 

In general this point has a first-order distance from the axis, but 
if the plane z — const, is the focal plane z — 2P/< (14-11), this 
distance is of the third order: we have then 
fa; = o-e/i-2(P-4(>/i2), 

\y = Ten-\F-‘^Q[P), 
the distance from the axis being 

(15-10) r = \e^fi~\P — 4LQfP)\. 

The rays (15-7) for arbitrary o', r do not pass through a point. 
There is a principal focus or first-order image at x = y — 0, 
z = 2 P/ 1 , but final rays inclined to the axis of the instrument 
pass by this point at a distance (15- 10) which is of the third order, 
and so not negligible for the present approximation. This devia- 
tion from the point image for incident rays parallel to the axis of 
the instrument is called S2>h,erical aberration. It is obvious from 
(15-9) that it is present unless the instrument is designed to 
satisfy the condition 

(15-11) P = 4cQii^. 

This is the condition for the absence of spherical aberration. 

Let us consider a mirror of revolution, for which we have, as 
in (13-10), 

(16-12) /,= !, P=iB, = 

Then 

1 

(15-13) P-4«/t2 = -^. 

The equation of the mirror is (13-8), with u = 0. The defect of 
spherical aberration will therefore be present in a mirror unless 
S — CO. This condition gives a paraboloidal mirror, for which 
indeed we know from elementary geometrical considerations or 
from (8-18) that rays parallel to the axis are reflected accurately 
through the geometrical focus. 
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In the case of a spherical mirror of radius R, we have as in ( 1 2- 5) 
8 = 2i2®, and hence 

(15-14) P-4Q/t2=xVi2. 

It is evident from (15-9) that for any instrument of revolution 
a final ray inclined to the axis of the instrument at an angle 0 cuts 
that axis at a distance 

H0\P ~ AQp^) 

in front of the focal point, i.e. at 

(15-15) z = 2Pp— — 4Qp^y, 

this also follows from (15-8) on putting x = 0. For the case of a 
spherical mirror this result is easily checked by trigonometry. 

Conditions for the formation of a point image. 

We have already seen that the condition (15-11) is necessary 
and sufficient for the formation of a point image of an object 
point at infinite distance on the axis, to the order of approximation 
considered. We shall now show that i/ (15-11) is satisfied arul also 
(15-16) Q,=^Q„ = 0, 

then any congruence of parallel incident rays at small inclination 
to the axis gives a point image. In this case the object point is at 
infinity, but not on the axis. 

Substituting from (15-16) in (15-5), we obtain 
— X + za'p~\l + = P,cr' + 2Por 

+ 4Qecr+ Q'e'tr' + 2Q'e'(T, 

— y + zrp-\ 1 + — Py + 2Pt 

+ 4^eT + Q’,e'r' + 2Q'e'T, 
as the equations of the final rays. The first of these may be 
written, by (15-11), 

(15-18) x + Py + Q',e'a-' = cr{{l + {zp-*^ — 2P) — 2Q'e'}. 

We are justified, to the order of approximation employed, in 
adding a term of the fifth order, since this is negligible: hence 
(15-18) and its companion may be written 

fl5-19i + + =0'(l + ie/i-2)(z;t-i-2P-2gV), 

^ ’ [y + py + gyr' =T(l + iep-<‘)(zp-i-2P-2Q'e'). 


(15-17) 
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To the order of approximation considered, all these final rays pass 
through the point 


(16-20) 


'x = -Py-Qy(r', 
■2/ = -Py-g:eV, 
z = 2Pfi + 2Q'e'ii. 


This estabhshes the result stated. 

By treating (15- 5) as identities in cr, t, it is easily seen that the 
conditions (15-11), (15-16) are necessary as well as sufficient for 
the formation of a point image. 


Astigmatism. 

Let us now suppose that 

(16-21) P = 4:Qii\ Q, = 0, <?„/0. 


Employing the same device as before, namely, the addition of 
terms of the fifth order, the equations (15-5) for the final rays 
may be written 


(16-22) 


x-\-P,(r' + — cr(l + — 2P — 2Q'e') 

= - ^Q„e,ar', 

y + Pj' + Q',e'r' - t( 1 + (z/*"' -2P- 2Q'k') 

= -2Q„ey. 


Let us choose the axes of x and y so that the plane of xz is i)arallel 
to the incident rays: then 


(15-23) t' = 0, e, = O’er', e' = cr'^, 

and (15-22) become 

'x+py+qy^ 

(15-24) - =(T{l + \eii~^)(zfi~^ — 2P — 2Q'(T'^ — 2Qy^), 

y = 7(1 + ley-^) {zy-^ -2P- 2Q'(r'^). 

Consider the lines 


'x+py+Qy^ = o, I 

z-2y(P+Q’cT'^ + Qy^) = 0,J 

y = 0, I 

z-2y(P+Q'a'^)^0, J 
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No matter what values tr and t may have, we can find z, z to 
satisfy the equations of L^, and therefore the first of (15-24); we 
can then choose y to satisfy the second of (16-24). Thus the ray 
(15-24) cuts Lj. Similarly, it cuts Lg. TAm all the final rays cut 
the two lines L^, L^. These fines are the focal lines of the final 


X 



bundle, but the general theory of focal fines is somewhat com- 
plicated by the approximations here employed. Fig. 30 shows 
these focal fines and their positions relative to the focal plane 
z = 2P/t for the case Q’ > 0, Q„ > 0, P, < 0, a' > 0. The diagram is 
not drawn to scale: actually the distance of from the axis of 
the instrument is of the order of or' , whereas the distances of 
and Pg from the focal plane and from one another are much smaller , 
namely, of the order of cr'^. 

If Q„ = 0, the two focal fines cut, and we get a point image, as 
indeed we know from earlier considerations. 

A ray, as given by (15-24), cuts the focal plane z = 2P/t in the 
point satisfying 

.,r oc^ \^ + P,<r' + Qy^ ^-2a<r'W + Q„h 

\y = -2ra'^Q'. 


Thus all final rays inclined at a small angle 0 to the axis of the 
instrument cut the focal plane in an ellipse, 

(z+py + Q'y^] 


(15-27) 


Q'+Q„ 




SGO 


6 
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its centre is at the point 

(15-28) x = -P,(r'-Qy^, ij = 0, 

and its area is 

(15-29) inye^a-'^Q'iQ' + Q,,). 

The ellipse reduces to a straight line if 
(15-30) Q' + Q^=.0 or Q' = 0, 

in either of which cases one of the focal lines is in the focal plane. 

The aberrational phenomenon present in the above system is 
known as astigmatism. The literal meaning of astigmatism refers 
to the failure of the instrument to form a point image. 

Coma. 

Let us now suppose that 

(15-31) P = g„ = 0. 

The equations (15-5) for the final rays may be written 

X + Py + Q',e'(T' — <r( 14- {zpr^ — 2P — 2Q'e') 

= -Q,{(T'e+2ey, 

■ y + py + QyT’ -T{l + ie/i-^)(z/i-^-2P-2Q’e') 

= -Q,{T'e + 2e,T). 

Let us, as before, take the xz plane parallel to the incident 
rays, so that (15-23) hold. Then (15-32) become 

fx + Py + Qy^ - <r(H- |e/<- 2 ) (z^-i — 2P- 2Q'(r'^) 
(15-33) ■ = -e,0-'(3or2 + T2), 

y-T(l + |e/f-2)(z/t-i-2P-2eV2) = -2Qy(TT. 
Let us examine the intersections of these rays with the plane 
(15-34) s = 2/i(P + $V2): 

by (15-20) this is the plane on which a point image would be 
formed if Q, = 0. Shifting the origin to the point 0 with co- 
ordinates 

(15-35) x = -Py — Qy^, y = 0, 2 = 0, 

and denoting the new x by x, we see that the ray with components 
(T, T cuts the plane (15-34) at the point 

(15-36) X = —Qy{3(r^ + T^), y = —2Qy<TT. 
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Let us introduce polar angles 0, so that 
(15-37) (T — coH<j>, t = sin sin 

Then the intersection is, to the third order. 


(15-38) 


'x — — Q^(r'fiW^{'i cos^ ^ + sin^ ^) 

= — Q,(T'fiW\2 + cos 20), 
y = — 2Q,o-'/i‘^6/2sin0 cos0 = —Q^cr'fiW^sm2(j). 


Those rays for which d is constant cut the plane (15-34) in the 
circle 


(15-39) {x + 2QyixW^)^ + y^ = 

T’he centre of this circle is at 

X = — 'iQyyW, y = 0, 4 

and its radius is | Q,cr'yW^\. Fig. 31 shows the projection of this 
circle on the plane z = 0. It is evident that the tangents drawn 
from the origin O make angles of 30° with the ^--axis. 



(The figure is drawn for P^cr' < 0, Q^(r' < 0.) Letting 0 vary, we 
see that the illuminated 'portion of the plane (15-34) is a 'wedge or 
flare of angle 60°, having its vertex at the point whose coordinates are 
(15-40) X = — y = 0, z = 2fi{P + Q'a''^). 

This phenomenon is called circular coma. (Cf. § 11 for general, or 
elliptical, coma.) Since in (15-35) P^a' is much greater than 
we see that the vertex of the wedge points towards the 
axis of the instrument (as in Fig. 31) or away from it, according 
as P,Q, is positive or negative. 


6-2 
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We have seen that a mirror, to be free from spherical aberration, 
must be paraboloidal, so that in (IS- 8) S — ao. We have then 
from (13*11) 

(16*41) P, = Q„ = 0, = 

Thus the paraboloidal mirror suffers from coma, the vertex of the 
wedge pointing towards the axis, as in Fig. 31. Putting cr' = d' 
in (15*39), we see that if the incident rays are inclined at an angle 
O' to the axis, the final rays inclined at an angle 6 to the axis cut 
the plane in a circle of radius \Rd'd‘^, the centre of the circle 
being at a distance \Rd'd^ from the vertex of the wedge. The 
vertex of the wedge is at a distance approximately ^Rd' from the 
axis of the instrument. {R is the radius of curvature of the 
paraboloidal mirror at its vertex.) 


Curvature of the image. 

Let us suppose that an instrument is corrected for spherical 
aberration, astigmatism and coma, so that a point image is 
formed by any set of parallel incident rays. We have then 
(15*42) P = = 0, Q„ = 0, 

and by (15*20) the point image is formed at 


(16*43) 


'x = -Py-Q:e'<r', 
.y = -Pj'-Q>e'T', 
z = 2fiP+2/iQ'e'. 


Suppose now that there is an extended distant object, such as a 
planet, which we regard as lying at infinity. From each point of 
it there comes a family of parallel rays (cr', t'), forming an image- 
point as given by (15*43). Varying cr', r', we get an image-surface: 
this surface will in general be curved, so that a curved photo- 
graphic plate would be necessary to obtain a sharp representa- 
tion of the whole object. 

The equation of the image-surface is approximately 
(15*44) 2 = 2fiP-\-2fiQ'(x^ + y^)IP^,: 

thus the radium of curvature p of the image-surface (counted positive 
when that surface has its concavity on the side 2 = + oo) is 
(15*45) p = P?/(4/*g'). 
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The condition for aflat image {no curvature) is therefore 
(16-46) Q' = 0. 

In the case of an instrument Avith coma (conditions 15-31), 
there is not a point image, but the coordinates (15-40) for the 
vertex of the coma-flare are the same as (15-43) with t' = 0. 
If we regard the vertex of the flare as an image, we may speak 
of an image-surface and its curvature. For a paraboloidal 
mirror, as in (13-11), 

( 15 - 47 ) p = IR- 

the radius of curvature of the image is half that of the mirror. 


Distortion. 


Let us now suppose that the instrument is corrected for 
spherical aberration, astigmatism, coma and curvature, so that 

(15-48) P = 4.Qp,\ Q, = 0, Q„ = 0, Q' = 0. 

Then, by (16-43), the image point corresponding to a', t' is at 


(15-49) 


'x = -P,<T'-Q',e'<T\ 
■y = -P,r'-Q:e'7', 
z = 2/tP. 


Although a plane image of an extended object is formed, it may 
not be perfect; distortion may be present. 

Let us first suppose that the object is plane, the plane being 
perpendicular to the axis of the instrument. Let the instrument 
be removed, and replaced by a screen perpendicular to the z-axis, 
the screen having an infinitesimal hole on the axis. This arrange- 
ment constitutes a “pin-hole camera”, and an image of the 
object plane will be formed on any plane perpendicular to the 
axis behind the hole. Any pattern drawn on the object plane will 
be reproduced to scale on the image plane, all lengths being 
enlarged or reduced in the same ratio. This is an image without 
distortion. 

If the object is not plane, we shall define an image without 
distortion as one formed in this way by projection through a 
point on the axis. The image formed by an optical instrument 
will be, by definition, free from distortion when it is a reproduct ion 
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to scale of the image that would be formed by projection through 
a point. 

Suppose then that there is an object point at infinity, the 
components of the rays from it being cr', t'. Taking projection 
through the origin on to a plane z = 1, the ray cr', t' gives the 
point 

(15-50) X = cr'lv', y = t' jv' . 

Denoting these coordinates by x, y, and using 

(15-51) o-'2 + T'2 + t;'2 = 

we see that to the third order of small quantities 


(15-52) 


ex' = 7i'i.i’x[\-l{x^ + y'^)\, 
t' = ri'ii'y[\-l{x^ + y^)], 


where y' = ± \ according as the incident rays are in the positive 
or negative sense. By (15-49) and (15-52) the corresponding 
image-point /ormed by the instrument is at 

(15-31 f* “ “ wyvp, - ih. 


The condition for a reproduction to scale, i.e. the exmdition for no 
distortion, is obviously 

(15-54) 2(2>'2 = P,. 

If this condition is not satisfied, the straight fine y = const, in 
the image by projection does not correspond to a straight line 
in the image formed by the instrument, but to a parabolic arc 
which curves away from ^ = 0 if 

(15-55) 2Q'/i'^/F,-l>0, 

and toward y = 0 if 

(15-56) 2^y2/P,-l<0. 

Also the straight line x = const, corresponds to a parabolic arc, 
curving away from or towards x = 0 according as (15-55) or 
(15-56) holds. 

The distortion corresponding to (15-55) is called cushion 
distortion and that corresponding to (15-56) is called barrel 
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distortion, the names being suggested by the patterns corre- 
sponding to a rectangular grid of Unes in the xy plane (Fig. 32). 



Fig. 32 a. Cushion distortion 



Fig. 326. Barrel distortion 


Collecting our results, we see that the conditions for the absence 
of spherical aberration, astigmatism, coma, curvature and dis- 
tortion are 

(15-57) P = ^Qfi\ = = = = 

so that the T’-function for such an instrument, perfect (to the 
order considered) for parallel incident rays, is 

(15-58) T = P'e' + Q''e'^+Q',e,{2n'^ + e') + Qe{4.ij,^ + e), 
the four remaining constants being arbitrary. 


16. The sine condition of Abbe. 

The preceding theory is approximate, dealing with rays ad- 
jacent to the axis of the instrument. We proceed to establish a 
condition which must be satisfied no matter what the inclination 
of the rays to the axis may be, if the instrument fulfils certain 
conditions of imagery. 
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Let 8' be a plane object perpendicular to the axis of the in- 
strument at Let us suppose that all rays from each point 
of 8' pass accurately through a single point in the image-space, 
so that an image-surface is formed, and that this image-surface 
of 8' is a plane 8 perpendicular to the axis at A. 

Let W(x',y',z',(r,T) be the IT-function of the instrument. 
Then by ( 6 - 10 ), ( 6 - 11 ) we have 
(16-1) <r' = -W,., T' = -W,,, 

(16-2) x-z(tIv = -W^, y-zTjv = -W,. 

Given x', y' , z' , cr, t, the equations (16-1) determine tr', t ': (16’2) 
are the equations of the final ray with components tr, t, v. Let 
y', z' be any point on 8' and x, y, z the corresponding image- 
point on 8. The origin is arbitrary; let us choose it at .4, so that 
2 = 0 and (16-2) read 

(16-3) x = y = -W,. 

Let us regard x', y' , cr, t as independent variables and x, y, cr', t' 
as functions of them. Then by our assumption as to the nature of 
the image, x, y are functions of x', y' only, independent of cr, t. 

By partial differentiation of (16-1), (16’3) we have 
(16-4) 
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Let the subscript 0 mean x' = y' = 0. Then from the symmetry 
of the instrument 




w being the magnification. Hence 
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Therefore 

(16-8) {(r')Q = m(r-\-a, {t'\ = mr -\-b, 

where a, b are constants. But ((t')q = 0 if o' = 0 and (t')o = 0 if 
T = 0. Hence a = b = 0, and so for corresponding rays through 
A', A we have 

(16*9) O'' = mcr, t' = mr. 

Thus if an exact plane image S of the plane 8' is formed, the in- 
clinations 6', d to the axis of the instrument of corresponding rays 
through A' and A must satisfy 

( 1 6' 10) /d sin 6' = m/i sin 6, 

fi' , n being the refractive indices of the initial and final media. This 
is known as the sine condition of Abbe. 

This result may also be established by means of the point- 
characteristie V, perhaps more directly. From the assumed 
property of exact imagery and Fermat’s principle, all the rays 
joining an assigned point on the plane through A' to its image 
have the same optical length V, which is a function of x', y' only. 
By symmetry V must be a maximum or minimum when 
x' — y' — 0, and hence SV = 0 for any infinitesimal displacement 
off the axis at A'. Therefore, by (5-8), 

(16-11) SV = o’Sx + rSy — a' Sx' — t' 8y' = 0, 
where o'', t' are the components of any ray through A' and o', t 
the components of the corresponding final ray through A; Sx', 
Sy' is an arbitrary displacement in the plane at A' and Sx, Sy the 
corresponding displacement at A. Combining (16-11) with (16-6), 
we deduce (16-9), and hence the sine condition (16-10) follows. 
This method is immediately applicable to the more general case 
in which S', 8 are not planes, but surfaces of revolution about 
the axis of the instrument: in that case also (16-10) holds. 

17. Calculation of T for a thin system. 

Consider an instrument of revolution (Fig. 34) in which surfaces 
8i{i= 1, 2, ...,n) separate media of refractive indices /ffl, /<i, 

Let the equation of Si be approximately 

(17-1) z = Vi + {rfx^ + y 2 ) + ^Siix^ + y^f. 
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where v^, are constants, being the reciprocal of the radius 
of curvature, counted positive when the surface is concave 
towards 2 = + oo. The notation has been slightly altered from that 
of { 1 2- 1 ) : we note that the condition for a spherical surface is now 
(17-2) = Jr?. 



A considerable mathematical simplification results from as- 
suming Vi = 0 {i—1,2, although it is physically impossible 

to bring the vertices of the surfaces into coincidence without 
breaking the refracting material, we shall assume that this 
condition is satisfied. The system so obtained is called a thin 
system in a technical sense. The behaviour of an actual instrument 
will approach more and more closely to that of a thin system the 
smaller the distances between the vertices. 

We shall denote the components in the several media by 

^0? ••• ’ '^ 11 ' 

For any quantity \Jr we shall write 
(17-3) A^xjr 

Thus, for example, is the increment in refractive index on 
crossing the surface 

With the necessary changes in notation, (12-22) gives for the 
T-function for the media separated by iS^, to the fourth order, 
(17-4) + 

X {1 + + 

- [(^i (tY + 7-)2]}. 
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(17-6) 


This may be written 

= T^n-i.i+ n\i+Tn.i, 
nu.i = mcrr+(A,Tn 

\Ti%l = K"(^i/*)-"[(^iO-)" + (-4iT)2]zl,|>-M(r2 + T2)l, 

The T-function for the complete instrument is 

T = T"^*\ 


(17-6) 


T(2) ^ 

i=l 

y/(4) ^ I 7T/(4) . 
i=l 

T"(4)= f 

4=1 


The intermediate components Uf, T; 1, 2, — 1) are to be 

eliminated, in accordance with the approximate method justified 
in § 13, by means of 


(17-7) 

Now 


doT; 


= 0, 


0T<2) _ 




(17-8) 

T(2) = j 2 {{(Tj - (r^_i)2 + (t,- - 

i=i 

' 9T(2) 

V 


Hence 


(17-9) 


(AiZ- = _A±l^_ = c 

jriAift ri+iA^^_^/i 

I _ ^ 4+1^ _ 

l^4^4/* ^4+1^4+li“ 


where G, D are independent of i. These fractions are invariants. 
Let us put 

(17-10) Fi^ = ^TjAjH, (i= 1,2, 

i=i 
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Then by (17-9) 

(17-11) 


(i — 1, 


= 2^/0- = G = GFi ^ 

i=i i=i 

I 7 -f-To = ^A^r = I)'^ r^AfU = 

I 1=1 j=i 

and SO 

(17-12) <r^-(T^ = CF-\ T^-r^ = DF-\ 

where we define 

(17-13) F-^ = F-^= ir^V- 

i=l 

By (17-11), (17-12) we have 

\TT-T-rfFF-^ 

( ^0 V^n. ^o) i J 

These eqvations give the intermediate components in terms of the 
initial and final components. 

By (17-9), (17-12) we have 

(17-15) (Zl^(r)2 + (J^T)2 = (riZlj/t)2i?’2{((r„-(ro)2 + (T,,-To)2}. 


Hence by (17-8), (17-13) we have for the T-function for the complete 
instrument to the second order 

(17-16) T<2) ^ ii?’{((r„-o-o)2 + (T„-T„)2}. 

By (17-5), (17-9), (17-12) 

117-171 f "" \nF%(rn-(roY + {rn-roY}^i<}>, 

Now 

(17-18) 

f 

n 

2 riA^ft = ri{<f>i-f>o) + r2{f>2-(fi) + ...+rJ^„-(f„_j), 

i==l 

n — 1 

= -ri<l>o + rnf>n- 1 
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Hence 

(17-19) rw = i nXi = + 

i=l 

n-1 
1 = 1 

in which the 0’s are to be replaced by the following expressions, 
obtained on substitution from (17-14) in the definitions (17-17): 

(17-20) 

(<l>i = + («■„ - cTo) FFi^f + [To + (r„ - To) FFr^f}, 

I (t = 1, 2, . • Tfc 1), 

l?^o = (O’o + ^o). + T’n)- 

yA'ws (17-19) expresses as a function of the initial and final 
components. 

Lastly, by (17-5), (17-9), (17-12), 

(17-21) rw = + 

i=\ 

Collecting our results and writing o-', r' for' the initial com- 
ponents and cr, t for the final components, we. have in the notation 
of (13-3) for the T -function of any thin system of revolution at the 
origin, to the fourth order inclusive, 

(17-22) 

' y=:y(2)+'p(« 

T<2) = -2e,+e), 

T(4) ^ jjiiT 2 (g/ _ 2 e^ -t-e) J 

< i-=l 

X {e'(l - FFi^f + 2e,FFr\l - FFi^) eF‘^Fz^]\ 

-lF\e'-2e, + ef'2«i^ih>'- 

i=l 


If we write as usual 

(17-23) T = P'e' + P,e,-\-Pe+Q''e'^ 

+ Q„e^, + + Q'^'^ + 
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we have, by comparison of coefficients, 

(17-24) 

P' = P = \F, P, = -F, 

i=l 

Q, = -iF\r^ii-^ + "^ FFi\\-2FF:f^)lii^A,^^r] 

£-1 

+ F‘^ 

i=l 

{Ff^ = X F= FJ, 

j=i 

If (ifi — l^n— have by (14-11) for the focal points 

(17-25) z' = -2P' = -F, z = 2P = F, 

while the nodal and principal points lie at 2 = 0 ; the focal length 
of the instrument is by (14-28) 

(17-26) F = (XriAac)-^. 

i—l 

F~^ is called the power. If we define the power of the pair of 
media separated by Si to be TiA^h, then the power of the whole 
instrument is the sum of the powers of the consecutive pairs of 
media. 

We note that T contains the factor e' — 2e, +e: hence we may 
write 

(17-27) = (e' — 2e, +e) {Ae' + Be, + Ce), 

and we shall have 

(.7.28, = 

' ' \Q', = B-iA, Q' = C + A. Q, = -2C + B. 

Of the Q's only three are independent: we have in fact 

(17-29) Q',=-\Q„-2Q'', Q, = -2Q-IQ„. 

We recall from (15-11), (15-16) that the conditions for the 
formation of point images are 

(17-30) P = ^Qfil Q, = 0, Q„ = 0. 
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Since P = \F # 0, if the focal length is not to vanish, the first of 
(17*30) demands and hence (17*30) are incompatible with 

the last of (17*29). Thus it is impossible to correct a thin instru- 
ment simultaneously for spherical aberration, astigmatism and 
coma. 

18. Aberrations of a thin lens. 

Let us now consider a thin lens in vacuo, bounded by the 
spherical surfaces 

(18*1) + + + 

j/Sa.- z = lr^{x^-^y^) + \s^{x^->ty‘^f, s^ = \r\-, 

r^, r^ are the reciprocals of the radii. The figure shows the case of 



Fig. 35 


a convex lens, r^ >0, r^< 0, but the argument to be developed 
applies to all signs. 

We are to put in the results of § 17 

(18*2) /*„ = = 1, /<i = /b = fi—\, A^p=\—ii, 

H being the refractive index of the lens. For the focal length F we 
have 

(18*3) F-i = riJj/f + rada// = {p-\){r^-rf), 
and so by (17*24) we have 

(18*4) P' = P = - |P, = |P = h{p-irHri-rf)-K 
Also 
(18*5) 


Pfi = 
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Application of (17’24) with n = 2 gives 

= kF\r^ + /i- Vf (ri - rg)-! 

- i(/t - l)-i {r\ + r-i ra + r|) {r^ - ra)"^], 
(18-6) ig, = -\F\r^ + FF:[\l-2FFy^)H-\r^-r^)\ 

+ J’«(5i-Sa) (/i-1) 

= -\F\r^+ /t- Vi(ri + ra) {r^ - r^)-^ 

[ 1)~^ (^1 + ^’i ^2 + rl) (ri - ra)-!]. 

Hence 

(18-7) 4g-P = P2[y^-|(/i-l)(/-i-ra)+/*“Vf(ri-r2)-i 

- ii/^ - 1)~^ (^1 + ^1^2 + 4) K - >”2)"^]- 

Thus by (IS* 11) the condition for the absence of spherical aberra- 
tion is 

(18-8) (4g - P) (ri - ra) P-2 = r^ir^ - ^ 2 ) - - 1 ) (»-i - r^Y 

+ fi-h\ - I in - 1 )-i (rf + r-j ra + r|) = 0. 

Given /i, this is a quadratic equation for the ratio rj/ra- However, 
since n>l necessarily, the roots prove to be imaginary, so that 
it is impossible to avoid spherical aberration for a single thin lens. 
We have also from (18*6) and (17-29) 

(18 9> \*Q + Q. = i^‘\T^+r'rA 

' {iQ-Q.^mQ+Q.), 

SO that g,, Q„ are easily calculated when Q has been found from 
(18-6). 

19. Chromatic aberrations. 

In the preceding work we have treated the refractive index of 
a medium as a constant. Actually it depends on the colour or 
frequency of the light employed. Thus our investigations up to 
this point must (except in the case of reflections) be regarded as 
applying to light of a single colour (monochromatic light). The 
phenomena arising from variation of refractive index with colour 
are known as phenomena of dispersion. 
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Since the T-function of an instrument depends on the refrac- 
tive indices of the media involved, it will depend on the colour 
of the light. Thus we should write 

(19*1) T==T{cr\T\(r,T,xh 

where x is some number which specifies the colour of the light. 
We might take for x wave-length of the light in vacuo, or its 
frequency. 

In an instrument involving refiections only, dispersion is 
entirely absent. This is obvious from the fact that the law of 
reflection does not depend on the refractive index, but it may also 
be seen by consideration of the T-function. If the reflections take 
place in vacuo, T will be independent of x* if they take place in 
a medium of index [i, T will be of the form /iT\ where T' is a 
function of direction cosines only, and the factor /i will disappear 
in the equations of the rays. 

In the instrument of revolution, for which T has the approxi- 
mate form (13*6), (13-7), the coefficients P' , P,, P, Q", ... will 
depend on x- Hence we arc not to expect that the absence of 
spherical aberration, for example, for one colour will imply its 
absence for other colours. More serious than this, however, the 
dependence oi T on x niakes itself felt even in the approximate 
theory based on the second-order terms in T, and leads to 
chromatic aberrations much more important than those arising 
from the dependence of the Q'h on x- first-order imagery of 
§ 14 will in general be different for different colours, because the 
quantities ^p^ ^p 

(and hence the cardinal points) will be different. We shall confine 
our attention to instruments in vacuo, so that = 1; the 

quantities determining the cardinal points are then P', P,, P. 

We cannot design a refracting instrument to make P', P,, P 
independent of x- But, with sufficient parameters (viz. refractive 
indices, curvatures and positions of surfaces) at our disposal, we 
can make these quantities take the same values for specified 
values of %, i.e. we can eliminate chromatic aberration for 
specified colours. Actually it is usual in practice to limit the 


SGO 


7 
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correction to two values of x, say Xv X 2 (corresponding to the G 
and F lines of the sodium spectrum). Using to indicate an 
increment on passing from Xi to X 2 > conditions for achro- 
matism for these two colours are 

(19-2) = = A^P = 0. 

For a thin combination in vacuo these reduce by (17-24) to 
the single condition 

(19-3) “ A^F-^ = A^{^uAifi) = 0, 

or 

(19-4) = 

i=l 

For a single thin lens with refractive index fi, this condition 
reads 

(19-5) {r^-r^)A^n = a, 

which cannot be satisfied except by the trivial solution = rg. 
For a thin double lens with curvatures r^, r^, and refractive 
indices the condition for achromatism is 

(19-6) {ri-r2)A^/ii + ir3-ri)A^/i3 = 0. 

The dispersive power of a medium is conventionally defined as 

(19-7) ^ = 

p— L 

where p, is the refractive index for some colour fixed conven- 
tionally (the sodium D-line). Thus (19*6) reads 

(19‘8) Dpr^ — r^{pi—\)+D2,{r^ — r^[p^—l) = 0, 

or 

(19-9) Z>i/Fi-f£>3/F3 = 0, 

where D^, are the dispersive powers of the lenses and F^, F^ 
their focal lengths for the colour corresponding to the index p. 

It is easily seen that for a general thin system of lenses, the 
condition for achromatism is 

(19-10) 2’(/>/F) = 0, 

where the summation extends over all the lenses, D and F being 
respectively the dispersive power and the focal length of a lens. 
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HETEROGENEOUS ISOTROPIC MEDIA 

20. Fermat’s principle. 

The media previously considered were homogeneom and iso- 
tropic, in the sense that the optical properties were the same at 
all points (homogeneity) and the same for all directions at each 
point (isotropy). The most general medium is heterogeneous and 
anisotropic, but we shall confine our attention to media which 
are heterogeneous and isotropic. 

It is assumed that in a heterogeneous isotropic medium there 
is a velocity of propagation at each point, in general variable from 
point to point: we may write it 

(20-1) v = v(x,y,z), 

X, y, z being rectangular Cartesian coordinates. The function v 
will also depend on the colour of the hght, but it wiU be unneces- 
sary to indicate this dependence. The refractive index is defined as 

(20-2) p = cjv, 

c being the velocity of light in vacuo-, p varies from point to point, 
and so the heterogeneous isotropic medium may be called a 
medium of variable refractive index. It is assumed that in each 
medium p is continuous and possesses continuous partial 
derivatives. 

If we draw any curve G, joining points A' and A, a point 
moving from A' to A along G and having in each position the 
assigned velocity v, will pass from A' to A in time 

rA rA 

(20*3) t = I dsjv = I pds. 

CA 

We define the optical length of (7 to be I pds, as in § 2. 

J A' 

We shall accept as a basis for our theory Fermat's principle in 
the following form: the actual ray along which light travels from 
A' to A has a stationary optical length when compared with adjacent 


7-2 
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curves joining A' and A. This is, of course, equivalent to saying 
that the time taken by the hght to travel from to ^ has a 
stationary value. 

We shall now find the differential equations of the rays in a 
single medium. For any curve 6’ joining A' and A with equations 


(20*4) X = x{u), y = y{u), z = «(«), 

u being any parameter, the optical length is 


(20-5) 


= / 
J Ui 


li{x, y, z) {x^ + y^ + 2 ^) j du, 


where u = u^a/t A' and u = u^a,t A and x = dxjdu, y = dyjdu, 
z = dz/du. This may be written 

'Ut 

vodu, 


( 20 - 6 ) 


(•tt, 

= 1 
J Ml 


\w = y{x, y, z) (i2 + ?/2 + 


w being a function of x, y, z, x, y, z. Let us take a set of adjacent 
curves, each described by equations of the form (20-4), the para- 
meter u running between the same terminal values on all the 
curves. Then the variation of L on passing from one of these 
curves to its neighbour is 

J 'u, 

8wdu 

Ml 

Z indicating a sum of terms obtained by changing x ^ y z, 
and 8x, 8y, 8z, 8x, 8y, 8z being infinitesimal increments obtained 
in passing from a point on one curve to the point on its neighbour 
with the same value of u. Then 


( 20 - 8 ) 


8x — ^ 8x', 
du 




8z = 4-8z, 
du 


and hence integration by parts gives 


Bw “I"* r“« / 

(20-9) 8L= S^8x - 4 

L 9a: \i 


d Bw 
du Bx 


9w\ 


8xdu. 



Fermat’s principle 


101 


The first part vanishes if the curves have common end-points, 
for then dx = 8y = 8z = 0 Sbt those points. If the curve G is the 
natural ray from .4' to .4, the remaining integral must vanish for 
values of 8x, 8y, 8z arbitrary along G save for the condition of 
vanishing at A ' and A . Hence it follows that, by virtue of Fermat's 
principle, a ray satisfies the differential equations 


( 20 - 10 ) 


d dw dw 
du dx dx 
d dw dw 
du dy dy 
d dw dw 
du dz dz 


these being in fact Euler’s equations for the extremals of J ri; du. 
Substituting for w from (20-6) we have 


(20-11) 


d dw 
du dx 


dw 

dx 


dr yx 
du \_{x^ +y^ + 2^)*. 


dy 

dx 


(x^ -f- Ip -f 2^)*, 


and similar forms for the other two expressions in (20-10). 

The parameter u along the ray G is still arbitrary. If we put 
u — s, the arc length of G, wo have 

( 20 - 12 ) x^ + y^ + z^—\ 

along G. Hence, by (20-11), the equations (20-10) become 


(20-13) 



d I d 2 \ dy _ 
IdsV ds/ dz 


If we take for u a parameter defined by 


(20-14) 
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we have du = dajfi, and so the equations of a ray may be written 


(20-15) 




Thvs we have in (20-10), (20-13) and (20-15) three different forms 
for the equations of the rays. 

Denoting by i the unit tangent vector to the ray, so that, by 
the first Frenet formula, 

(20-16) dilds==ilp, 

where j is the unit principal normal, drawn to the concave side of 
the projection of the ray on its osculating plane, and p is the 
radius of curvature (always positive), the equations (20-13) read 
in vector form 

(20-17) ^ (/ii) = grad/i, 

or 

(20-18) ^i+/ij/p = grad/*. 


Thus the gradient of the refractive index lies in the osculating plane 
of the ray. Also, operating on (20-18) with j., we get 

(20-19) /ijp = j.grad/* = dpjdn, p~^ = 0(log/*)/3w, 
where d/'dn indicates differentiation along the principal normal, 
dn being an element of length of this 
normal. Since p is positive by defini- 
tion, dfijdn is positive. Thus the 
refractive index increases as we go 
along the principal normal, or, in 
other words, the ray bends toward the 
region of higher refractive index 
(Fig. 36). 

Let us consider the case where the medium consists of parallel 
planes of equal refractive index. Taking the z-axis perpendicular 
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to these planes, we have /t = fi(z). Then integration of the first 
two of (20- 13) gives 

(20’20) iidxjds = a, fidyjds = b, 

where a and b are constants. Hence 

(20-21) {dzjdsf = 1 - {dxjdsf - {dyjdsf = 1 - (a^ + 

We note that 

(20-22) dyjdx = bja, 

and hence the projection of each ray onz = const, is a straight line, 
as indeed we might expect from symmetry. 

From (20-20), (20-21) we have 

(20-23) {dzjdx)^ = (/i^ — a^ — b^)ja^, {dzjdyY = {/i^ — a^ — b^)jb^, 
which give x and y as functions of z by quadrature. Hence 
we have for a ray in a stratified medium p — p(z) the integrated 
equations 

(20-24) = ^ 

the ambiguous signs corresponding to those occurring when the 
roots of (20-23) are taken. To avoid confusion arising from these 
ambiguities and to get a general idea of the behaviour of the rays, 
we may proceed as follows. Since the left-hand sides in (20-23) 
cannot be negative, the ray cannot leave the region for which 

(20-25) («•-* + 62)1. 

The constants a and b are determined by the initial point and 
direction of the ray, and the right-hand side of (20-25) has a 
simple meaning. If 6 denotes the inclination of the ray to the 
2-axis, we have in general 

(20-26) a^ + b'^ = p\{dxjds)^ + {dylds)'^'\ — p^sin^d, 

and so (20-25) may be written 

(20-27) p — p’ smO' '^0, 

the accents denoting initial values. The medium may be divided 
into layers in which the sign of the left-hand side of (20-27) is 
alternately positive and negative. The initial point lies in a layer 
for which this quantity is positive, and the ray cannot leave this 
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layer, which is bounded by planes z = z^, z = z^, satisfying the 
equation 

(20'28) [i = fi' miO'. 

The ray is a periodic plane curve oscillatitig between these planes 
and touching them, the increments in x and y between successive 
contacts being respectively 

(20-29) a r — -A , 6 r* - - . 

The layer containing the ray may, of course, extend to infinity 
above or below, in which case the modifications in the argument 
are obvious. 


Let us now consider the case where the refractive index has 
spherical symmetry with respect to a point O. This corresponds 
apijroximately to the case of refraction in the earth’s atmosphere, 
when the curvature of the earth is taken into account. Let r 
denote the position vector of a point in the medium, relative to 0, 
so that [i = /i{r). Then 


(20-30) 


. _ dr 

^~ds' 


grad/i = 


r 


dfi 

rdr' 


Now, by (20-17), we have along a ray 
(20-31) 


d , ..dr . d , 


= r X grad/t 

= 0 . 


Thus /ir X i is a constant vector, which shows that the ray lies in 
a plane through O, and further fir simp = const., where is the 
angle between the radius vector and the ray. The analogy to the 
dynamical theory of orbits under central forces is obvious. This 
relation may also be written fip = const., where p is the per- 
pendicular dropped from 0 on the tangent to the ray. 

Returning to the general heterogeneous medium, let us denote 
by a, /?, y the d irection cosines of a ray, and let us define the 
components cr, t, y of a ray by the equations 

(20-32) cr = fioi, t = fi^, v = fiy. 
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Then the equations (20-13) for a ray may be written 


(20-33) 


d(T _dii dr _ d(i dv _ 0/t 
ds dx' ds dy’ ds dz ' 


We may note that in the case where is a function of z only, the 
result (20-20) may be written a = const., t = const. 

The law of refraction when a ray passes from one medium to 
another across a surface of discontinuity of y may be deduced 
easily from (20-9). (We might also deduce it from (20-33), by 
proceeding to a hmit in wliich the gradient of the refractive index 



tends to infinity.) Let A' BA (Fig. 37) be a natural ray, crossing 
a surface of discontinuity at B, and let A'CA be an adjacent 
broken curve. In (20-9), which is a general formula for variation 
of optical length in a single medium, the integral vanishes if the 
curve from which the variation is made is a natural ray; the 
formula then reads, if s be taken for parameter, 

r I”* r 

(20-34) ^a:J = j^2’cr(Ja;J . 

Applying this formula first to the variation from A'B to A'C, 
and secondly to the variation from BA to CA, we have for 
differences in optical lengths 

(20-35) [A'C] - [A'B] = Icrjx, 

[GA]-[BA] = -Za-Jx, 

where cr^, t^, are the components of the ray just after refrac- 
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tion, cTg, Tg, ^2 components just before, and Sx, 8y, 8z the com- 
ponents of the displacement BC. Consequently (20*34) gives 

(20*36) [A'CA\-[A'BA\ = -ZA(r8x, 

where Aa, At, Av are the increments on crossing the surface. 
Hence, by Fermat’s principle, 

(20*37) 2:A(T8x = (i 

for an arbitrary infinitesimal displacement on the surface, and so 

(20*38) Acrjl — Arlm = Av/n, 

where I, m, n are the direction cosines of the normal to the surface, 
as in (2*17). The same formulae hold for reflection. Thv^ (20*38) 
is the law of reflection or of refraction at a surface of discontinuity 
of the refractive index between two heterogeneous isotropic media. 


21. The characteristic function F. 


Let us now introduce the characteristic function V for a hetero-. 
geneous isotropic medium. Let A'(x' ,y' ,z') and A{x,y,z) be 
two arbitrarily selected points in the medium: the characteristic 
function 

(21*1) V = V{x',y',z',x,y,z) 


is defined to be the optical length of the natural ray A' A. Let us 
seek an expression for the infinitesimal change in V due to arbi- 
trary infinitesimal displacements of A' and A to B' and B 
respectively. If is a parameter running between the same 
terminal values on the varied and unvaried rays, the variation 
in the optical length, that is, 8V, is given by (20*9). Since the 
unvaried curve is a natural ray, the integral vanishes, and if we 
take u = s, the arc-length of the ray, we have 


/ni o 

( 21 - 2 ) ^ = = = 

Hence, using accents to denote initial values, we have 


(21*3) 8V = I:<t8x-E<t'8x', 



and sof 
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(21-4) 


0-' 

, 

dV 

dx' ~ ’ 

dy' 

dz' 

dv 

dV 

dV 

dx 

~dy 

dz 


Hence on account of the identities 


( 21 - 6 ) 
we have 

( 21 - 6 ) 



These two partial differential equations^ are satisfied by the 
characteristic function V. 

If we have a system of media, we define the characteristic 
function F for the system as the optical length of the natural ray 
joining a point A'{x',y',z') in the initial medium to a point 
A{x, y, z) in the final medium, l^et us vary A', A to B' , B respec- 
tively. By Fermat’s principle, the optical length of the natural 
ray B'B is equal (to the first order) to that of a curve C joining 
B' and B and coinciding with the ray A' A except in the initial 
and final media. Application of (20-9) to the terminal portions 
leads us at once to the expression (21-3) for SV, and henee to the 
equations (21-4) and (21'6), which consequently are true not only 
for a single medium, but also for a system of media separated by 
surfaces across which the refractive index is discontinuous. 


22. The construction of Huyghens. 

Let us now consider the construction of Huyghens. We imagine 
a wave-front 8 at time t (Fig. 38). To find the wave-front at time 
t + dt, we take elementary spheres having their centres on 8 with 

t It is assumed here that arbitrary independent variations may be given to A ' 
and A : cf. the footnote in connection with (5-9). 

J Cf. (5' 14). Either of the equations (21*6) may be regarded as the Hamilton- 
J acobi equation for a particle moving in a conservative field of force. 
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radii equal to the distance travelled by light in time dt, namely 

vdt. The new wave-front is the envelope of these spheres. 

Actually, the envelope consists of 

two sheets, one in front and one v)y\ 

behind S, but we suppose the sense \A7\ 

of propagation assigned, so that one \y\j\ 

of these sheets is ruled out. T \ ]\ 

The wave of which we are speak- 
ing reaches a given point x, z at ( 
some time t. This i is a function of ^ 

X, y, z, and so we can write the equa- 
tion of the wave-front in the form 

(22-1) ct = S{x,y,z). ( n y 

This equation describes the whole ^ 

history of the wave-front. The in- ^ / 

stantaneous position is given by ' 

taking t constant. 

It is evident that, given v{x, y, z), or equivalently fi{x, y, z), the 
construction of Huyghens gives a definite development for the 
wave. In this construction the rays are defined by the condition 
that the ray through a point P oi S passes through the point of 
contact of the adjacent envelope with the elementary sphere 
having its centre at P. It is obvious, then, that the ray is normal 
to the wave. (This is not necessarily the case for anisotropic 
media, the elementary waves not being spheres.) Accordingly 
if a, yd, y are the direction cosines of a ray, we have 


( 22 - 2 ) 




where d is a factor of proportionality. By (22’ 1 ) we have, moving 
with the wave. 


(22-3) 


dS = E^dx = \Sa, 




where ds is an element of the ray. But ds = vdt, y — cjv, and so 
jjt, = 0. Thus by (22-2) we have for the components of the ray, 
defined as in (20-32); 
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(22-4) 




dz ' 


Therefore S satisfies the partial differential equation 


(22-5) 





We shall now show that the rays, defined as above in terms of 
the construction of Huyghens, are in fact identical with the rays 
given by the principle of Fermat, each being determined by an 
initial point and direction. Along the ray as given by the con- 
struction of Huyghens we have by (22'4) 


( 22 - 6 ) 


d(T __ d d8 
ds ds dx 


dx‘‘‘ dx ox dz'^ 

^ \_dx^ dx ^ dy dx dy ^ dz dx dzj 



— S.n — 1 /y2 

- 2 /* 9 ^/* 


dy 

dx' 


But this is the first of the differential equations (20-33) satisfied 
by the rays given by the principle of Fermat, and the other two 
differential equations follow of course similai-ly. Thus we are able 
to reconcile completely the principle of Fermat and the construction 
of Huyghens in a heterogeneous isotropic medium. 

When we have to deal with reflection or refraction, it is easy 
to see that the construction of Huyghens gives the same law as 
Fermat’s principle, namely (20-38). 

We saw in § 3 that, in the case of homogeneous media, rays 
emanating from a point source form a normal rectihnear con- 
gruence after any number of reflections or refractions. In the 
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case of heterogeneous media, it follows at once from the con- 
struction of Huyghens that rays emanating from a point source 
form a normal congruence of curves after any number of reflections 
or refractions. This result also follows from the formula (21'3) for 
the variation of the characteristic function, which, as we have 
seen, holds not only for a single medium, but also for a system of 
media. For if A'(x', y', z') is a point source in the initial medium 
and A {x, y, z), B(x +8x,y + Sy, z + Sz) adjacent points in the final 
medium such that the optical lengths of the rays A'A, A'B have 
a common value V, then, since 

Sx' = dy' = Sz' = 0, 

and SV = 0, we have 

(22-7) laSx^O, 

which establishes the orthogonality of the ray A'A to the surface 
V = const. It is evident that the surfaces V = const, are in fact the 
successive positions of a wave. 

The function S which occurs in the interpretation of the con- 
struction of Huyghens is closely related to the characteristic 
function V. As we pass along a ray we have by (22’4) 

O O HQ 

(22-8) dS = 2:^dx = zl-ads = /ids. 

OX ox 

Thus the increment in S on passing from one wave to another is 
the optical length of a ray, measured from one wave to the other. 
Let be any wave. Given any point x, y, z, let the ray through it 
be drawn, cutting Sq at x',y' ,z'. Then 

(22-9) S(x, y, z) - Six', y' , z') = V(x', y', z' , x, y, z). 







